mentat

Author	SHA1	Message	Date
Greg Burd	4f81c4e15b	Attempting to cleanup with clippy, rustfmt, etc. Integrate https://github.com/mozilla/mentat/pull/806	2020-01-31 10:55:45 -05:00
Greg Burd	b2f92b8461	Update to 2018 edition of Rust (1.42). Fix and format code. Update dependencies. Fix tests.	2020-01-16 10:58:21 -05:00
Grisha Kruglov	9fd198f96a	Pre: Move ValueTypeSet into core-traits	2018-08-09 13:16:05 -07:00
Grisha Kruglov	07beb68c7a	Pre: Remove query/ crate	2018-08-09 13:16:05 -07:00
Grisha Kruglov	d0214fad7d	Pre: Move core/types.rs into core_traits	2018-08-09 13:16:05 -07:00
Grisha Kruglov	a57ba5d79f	Pre: Move Entid and KnownEntid into core_traits	2018-08-09 13:16:05 -07:00
Nick Alexander	cfed968514	Review comments.	2018-06-04 15:21:27 -07:00
Nick Alexander	47441f56dc	Part 5: Push FindQuery into query-algebrizer; structure errors. This is a big deck-chair re-arrangement. This puts FindQuery into query-algebrizer and puts the validation from ParsedFindQuery -> FindQuery their as well. Some tests were re-homed for this. In addition, the little-used maplit crate dependency was replaced with inline expressions.	2018-06-04 15:04:39 -07:00
Nick Alexander	d8d18a1731	[query] Handle SQL NULL for aggregates over 0 rows. (#684 ) (#688 ) r=rnewman This uses a `SELECT *` from an inner subselect to filter potentially `NULL` aggregates. The alternative is to handle `NULL` values throughout the projector, which is simple but loses a valuable invariant: Mentat SQL queries produce values that are not `NULL`.	2018-06-01 14:17:31 -07:00
Richard Newman	3dc68bcd38	Combine NamespacedKeyword and Keyword. (#689 ) r=nalexander * Make properties on NamespacedKeyword/NamespacedSymbol private * Use only a single String for NamespacedKeyword/NamespacedSymbol * Review comments. * Remove unsafe code in namespaced_name. Benchmarking shows approximately zero change. * Allow the types of ns and name to differ when constructing a NamespacedName. * Make symbol namespaces optional. * Normalize names of keyword/symbol constructors. This will make the subsequent refactor much less painful. * Use expect not unwrap. * Merge Keyword and NamespacedKeyword.	2018-05-11 09:52:17 -07:00
Richard Newman	f979044ba1	Refactor value type boxing. (#659 ) r=nalexander * Pre: eliminate some occurrences of Rc, largely through the magic of Into. * Pre: introduce FromRc to convert between refcounted types. * Introduce ValueRc as an abstraction over Rc/Arc choice. * Move Cloned to core. * Move CString-creation methods to TypedValue. * Finish transition.	2018-04-25 14:23:27 -07:00
Nick Alexander	c8da4be38f	(query) Implement tx-log API: (tx-ids ...) and (tx-data ...) functions. `tx-ids` allows to enumerate transaction IDs efficiently. `tx-data` allows to extract transaction log data efficiently. We might eventually allow to filter by impacted attribute sets as well.	2018-04-19 09:58:41 -07:00
Nick Alexander	e532614908	(query) Pre: Model columns that don't have type tags closer to Column.	2018-04-19 09:58:41 -07:00
Richard Newman	a57f7aff99	Add specialized tx-before and tx-after predicates. (#599 ) r=emily	2018-04-05 10:49:06 -07:00
Richard Newman	833ff92436	Simple aggregates. (#584 ) r=emily * Pre: use debugcli in VSCode. * Pre: wrap subqueries in parentheses in output SQL. * Pre: add ExistingColumn. This lets us make reference to columns by name, rather than only pointing to qualified aliases. * Pre: add Into for &str to TypedValue. * Pre: add Store.transact. * Pre: cleanup. * Parse and algebrize simple aggregates. (#312) * Follow-up: print aggregate columns more neatly in the CLI. * Useful ValueTypeSet helpers. * Allow for entity inequalities. * Add 'differ', which is a ref-specialized not-equals. * Add 'unpermute', a function for getting unique, distinct pairs from bindings. * Review comments. * Add 'the' pseudo-aggregation operator. This allows for a corresponding value to be returned when a query includes one 'min' or 'max' aggregate.	2018-03-12 15:18:50 -07:00
Richard Newman	1817ce7c0b	Performance and cleanup. r=emily * Use fixed-size arrays for bootstrap datoms, not vecs. * Wide-ranging cleanup. This commit: - Deletes some dead code. - Marks some functions only used by tests as cfg(test). - Adds pub(crate) to a bunch of functions. - Cleans up a few other nits.	2018-03-06 09:03:00 -08:00
Richard Newman	e33fe71c47	Rework caching and use it inside the query engine. (#553 ) r=emily This puts caching in mentat_db, adds a reverse lookup capability for unique attributes, and populates bidirectional caches with a single SQL cursor walk. Differentiate between begin_read and begin_uncached_read. Note that we still allow toggling within InProgress, because there might be transient local state that makes starting a new transaction impossible.	2018-02-21 11:51:45 -08:00
Thom Chiovoloni	98502eb68f	Implement type annotations in queries. (#526 ) r=rnewman	2018-01-29 14:37:53 -08:00
Richard Newman	df90c366af	Partial work from simple aggregates work (#497 ) r=nalexander * Pre: make FindQuery, FindSpec, and Element non-Clone. * Pre: make query translator return a Result. * Pre: make projection return a Result. * Pre: refactor query parser in preparation for parsing aggregates. * Pre: rename PredicateFn -> QueryFunction. * Pre: expose more about bound variables from CC. * Pre: move ValueTypeSet to core.	2017-11-30 15:02:07 -08:00
Richard Newman	eaf3e7fc4b	Extend inequalities to Instants. (#439 ) r=fluffyemily,nalexander	2017-06-16 11:57:44 -07:00
Richard Newman	565a0e9ff9	Implement MATCHES throughout SQL machinery.	2017-06-15 10:28:10 -07:00
Richard Newman	03c0930285	Pre: implement IntoIterator for ValueTypeSet.	2017-06-15 10:27:51 -07:00
Nick Alexander	79fa0994b3	Part 3: Handle `ground`. (#469 ) r=nalexander,rnewman This version removes nalexander's lovely matrix code. It turned out that scalar and tuple bindings are sufficiently different from coll and rel -- they can directly apply as values in the query -- that there was no point in jumping through hoops to turn those single values into a matrix. Furthermore, I've standardized us on a Vec<TypedValue> representation for rectangular matrices, which should be much more efficient, but would have required rewriting that code. Finally, coll and rel are sufficiently different from each other -- coll doesn't require processing nested collections -- that my attempts to share code between them fell somewhat flat. I had lots of nice ideas about zipping together cycles and such, but ultimately I ended up with relatively straightforward, if a bit repetitive, code. The next commit will demonstrate the value of this work -- tests that exercised scalar and tuple grounding now collapse down to the simplest possible SQL.	2017-06-09 20:18:31 -07:00
Richard Newman	4a886aae17	Pre: derive Debug.	2017-06-09 20:16:38 -07:00
Richard Newman	a10c6fc67a	Pre: make ValueTypeSet Copy, as it only newtypes EnumSet, which is Copy.	2017-06-09 20:16:36 -07:00
Richard Newman	dbbbd220f9	Pre: add helpers to ValueTypeSet.	2017-06-09 20:16:35 -07:00
Emily Toop	bd389d2f0d	Parse and Algebrize `not` & `not-join`. (#302 ) (Closes #303 , #389 , #422 ) r=rnewman * Part 1 - Parse `not` and `not-join` * Part 2 - Validate `not` and `not-join` pre-algebrization * Address review comments rnewman. * Remove `WhereNotClause` and populate `NotJoin` with `WhereClause`. * Fix validation for `not` and `not-join`, removing tests that were invalid. * Address rustification comments. * Rebase against `rust` branch. * Part 3 - Add required types for NotJoin. * Implement `PartialEq` for `ConjoiningClauses` so `ComputedTable` can be included inside `ColumnConstraint::NotExists` * Part 4 - Implement `apply_not_join` * Part 5 - Call `apply_not_join` from inside `apply_clause` * Part 6 - Translate `not-join` into `NOT EXISTS` SQL * Address review comments. * Rename `projected` to `unified` to better describe the fact that we are not projecting any variables. * Check for presence of each unified var in either `column_bindings` or `input_bindings` and bail if not there. * Copy over `input_bindings` for each var in `unified`. * Only copy over the first `column_binding` for each variable in `unified` rather than the whole list. * Update tests. * Address review comments. * Make output from Debug for NotExists more useful * Clear up misunderstanding. Any single failing clause in the not will cause the entire not to be considered empty * Address review comments. * Remove Limit requirement from cc_to_exists. * Use Entry.or_insert instead of matching on the entry to add to column_bindings. * Move addition of value_bindings to before apply_clauses on template. * Tidy up tests with some variable reuse. * Addressed nits, * Address review comments. * Move addition of column_bindings to above apply_clause. * Update tests. * Add test to ensure that unbound vars fail * Improve test for unbound variable to check for correct variable and error * address nits	2017-04-28 10:44:11 +01:00
Richard Newman	19fc7cddf1	[query] Widen `known_types` correctly in complex `or`. (#424 ) r=nalexander * Part 1: define ValueTypeSet. We're going to use this instead of `HashSet<ValueType>` so that we can clearly express the empty set and the set of all types, and also to encapsulate a switch to `EnumSet`." * Part 2: use ValueTypeSet. * Part 3: fix type expansion. * Part 4: add a test for type extraction from nested `or`. * Review comments. * Review comments: simplify ValueTypeSet.	2017-04-24 14:15:26 -07:00
Richard Newman	35d73d5541	Implement :order. (#415 ) (#416 ) r=nalexander This adds an `:order` keyword to `:find`. If present, the results of the query will be an ordered set, rather than an unordered set; rows will appear in an ordered defined by each `:order` entry. Each can be one of three things: - A var, `?x`, meaning "order by ?x ascending". - A pair, `(asc ?x)`, meaning "order by ?x ascending". - A pair, `(desc ?x)`, meaning "order by ?x descending". Values will be ordered in this sequence for asc, and in reverse for desc: 1. Entity IDs, in ascending numerical order. 2. Booleans, false then true. 3. Timestamps, in ascending numerical order. 4. Longs and doubles, intermixed, in ascending numerical order. 5. Strings, in ascending lexicographic order. 6. Keywords, in ascending lexicographic order, considering the entire ns/name pair as a single string separated by '/'. Subcommits: Pre: make bound_value public. Pre: generalize ErrorKind::UnboundVariable for use in order. Part 1: parse (direction, var) pairs. Part 2: parse :order clause into FindQuery. Part 3: include order variables in algebrized query. We add order variables to :with, so we can reuse its type tag projection logic, and so that we can phrase ordering in terms of variables rather than datoms columns. Part 4: produce SQL for order clauses.	2017-04-17 11:30:31 -07:00
Richard Newman	08d2c613a4	Part 2: expand the definition of a table to include computed tables. This commit: - Defines a new kind of column, distinct from the eavt columns in `DatomsColumn`, to model the rows projected from subqueries. These always name one of two things: a variable, or a variable's type tag. Naturally the two cases are thus `Variable` and `VariableTypeTag`. These are cheap to clone, given that `Variable` is an `Rc<String>`. - Defines `Column` as a wrapper around `DatomsColumn` and `VariableColumn`. Everywhere we used to use `DatomsColumn` we now allow `Column`: particularly in constraints and projections. - Broadens the definition of a table list in the intermediate "query-sql" representation to include a SQL UNION. A UNION is represented as a list of queries and an alias. - Implements translation from a `ComputedTable` to the query-sql representation. In this commit we only project vars, not type tags. Review comment: discuss bind_column_to_var for ValueTypeTag. Review comment: implement From<Vec<T>> for ConsumableVec<T>.	2017-04-12 19:21:33 -07:00
Richard Newman	7948788936	Part 1: define ComputedTable. Complex `or`s are translated to SQL as a subquery -- in particular, a subquery that's a UNION. Conceptually, that subquery is a computed table: `all_datoms` and `datoms` yield rows of e/a/v/tx, and each computed table yields rows of variable bindings. The table itself is a type, `ComputedTable`. Its `Union` case contains everything a subquery needs: a `ConjoiningClauses` and a projection list, which together allow us to build a SQL subquery, and a list of variables that need type code extraction. (This is discussed further in a later commit.) Naturally we also need a way to refer to columns in a computed table. We model this by a new enum case in `DatomsTable`, `Computed`, which maintains an integer value that uniquely identifies a computed table.	2017-04-12 11:13:58 -07:00
Richard Newman	0639c94468	Part 2: implement simple `or`.	2017-04-07 12:46:25 -07:00
Richard Newman	74f188df9b	Part 5b: rename also/instead to add_intersection and add_alternate.	2017-03-30 19:13:20 -07:00
Richard Newman	9e5c735460	Part 5: split cc.rs into a 'clauses' module. mod.rs defines the module and ConjoiningClauses itself, complete with methods to record facts and ask it questions. pattern.rs, predicate.rs, resolve.rs, and or.rs include particular functionality around accumulating certain kinds of patterns. Only `or.rs` includes significant new code; the rest is just split.	2017-03-30 19:13:20 -07:00
Richard Newman	01ca0ae5c1	Part 2: add an EmptyBecause case for fulltext/non-string type mismatch.	2017-03-30 19:13:19 -07:00
Richard Newman	997df0b776	Part 1: introduce ColumnIntersection and ColumnAlternation. This provides a limited form of OR and AND for column constraints, allowing simple 'or-join' queries to be expressed on a single table alias.	2017-03-30 19:13:19 -07:00
Richard Newman	95a5326e23	Pre: move EmptyBecause into types.rs.	2017-03-30 18:03:03 -07:00
Richard Newman	97749833d0	Algebrize and translate numeric constraints. (#306 ) r=nalexander	2017-03-22 10:19:47 -07:00
Richard Newman	3d66cb5d0f	Pre: move query algebrizer types to their own file.	2017-03-22 10:13:45 -07:00

39 commits