mentat

Author	SHA1	Message	Date
Nick Alexander	cfed968514	Review comments.	2018-06-04 15:21:27 -07:00
Nick Alexander	47441f56dc	Part 5: Push FindQuery into query-algebrizer; structure errors. This is a big deck-chair re-arrangement. This puts FindQuery into query-algebrizer and puts the validation from ParsedFindQuery -> FindQuery their as well. Some tests were re-homed for this. In addition, the little-used maplit crate dependency was replaced with inline expressions.	2018-06-04 15:04:39 -07:00
Richard Newman	e21156a754	Implement simple pull expressions (#638 ) r=nalexander * Refactor AttributeCache populator code for use from pull. * Pre: add to_value_rc to Cloned. * Pre: add From<StructuredMap> for Binding. * Pre: clarify Store::open_empty. * Pre: StructuredMap cleanup. * Pre: clean up a doc test. * Split projector crate. Pass schema to projector. * CLI support for printing bindings. * Add and use ConjoiningClauses::derive_types_from_find_spec. * Define pull types. * Implement pull on top of the attribute cache layer. * Add pull support to the projector. * Parse pull expressions. * Add simple pull support to connection objects. * Tests for pull. * Compile with Rust 1.25. The only choice involved in this commit is that of replacing the anonymous lifetime '_ with a named lifetime for the cache; since we're accepting a Known, which includes the cache in question, I think it's clear that we expect the function to apply to any given cache lifetime. * Review comments. * Bail on unnamed attribute. * Make assert_parse_failure_contains safe to use. * Rework query parser to report better errors for pull. * Test for mixed wildcard and simple attribute.	2018-05-04 12:56:00 -07:00
Richard Newman	833ff92436	Simple aggregates. (#584 ) r=emily * Pre: use debugcli in VSCode. * Pre: wrap subqueries in parentheses in output SQL. * Pre: add ExistingColumn. This lets us make reference to columns by name, rather than only pointing to qualified aliases. * Pre: add Into for &str to TypedValue. * Pre: add Store.transact. * Pre: cleanup. * Parse and algebrize simple aggregates. (#312) * Follow-up: print aggregate columns more neatly in the CLI. * Useful ValueTypeSet helpers. * Allow for entity inequalities. * Add 'differ', which is a ref-specialized not-equals. * Add 'unpermute', a function for getting unique, distinct pairs from bindings. * Review comments. * Add 'the' pseudo-aggregation operator. This allows for a corresponding value to be returned when a query includes one 'min' or 'max' aggregate.	2018-03-12 15:18:50 -07:00
Richard Newman	1817ce7c0b	Performance and cleanup. r=emily * Use fixed-size arrays for bootstrap datoms, not vecs. * Wide-ranging cleanup. This commit: - Deletes some dead code. - Marks some functions only used by tests as cfg(test). - Adds pub(crate) to a bunch of functions. - Cleans up a few other nits.	2018-03-06 09:03:00 -08:00
Richard Newman	f42ae35b70	Update cache on write. (#566 ) r=emily * Use the cache to make constant queries super fast. * Fix translate tests to match: we no longer generate SQL for many of them! * Accumulate additions and removals into the cache. * Make attribute cache clone-on-write; store it in Metadata. * Allow caching of fulltext attributes, interning strings.	2018-03-06 09:01:20 -08:00
Richard Newman	e33fe71c47	Rework caching and use it inside the query engine. (#553 ) r=emily This puts caching in mentat_db, adds a reverse lookup capability for unique attributes, and populates bidirectional caches with a single SQL cursor walk. Differentiate between begin_read and begin_uncached_read. Note that we still allow toggling within InProgress, because there might be transient local state that makes starting a new transaction impossible.	2018-02-21 11:51:45 -08:00
Richard Newman	66e6fef75e	Define Store, use TabWriter in the CLI for aligning columnar output. (#540 ) r=emily * Define Store, which is a simple container for a SQLite connection and a Conn. This is a breaking change. * Return the FindSpec as part of QueryOutput, not just results. * Switch to using stderr in appropriate places in CLI. * Print columns in CLI output.	2018-02-01 09:29:07 -08:00
Thom Chiovoloni	98502eb68f	Implement type annotations in queries. (#526 ) r=rnewman	2018-01-29 14:37:53 -08:00
Richard Newman	df90c366af	Partial work from simple aggregates work (#497 ) r=nalexander * Pre: make FindQuery, FindSpec, and Element non-Clone. * Pre: make query translator return a Result. * Pre: make projection return a Result. * Pre: refactor query parser in preparation for parsing aggregates. * Pre: rename PredicateFn -> QueryFunction. * Pre: expose more about bound variables from CC. * Pre: move ValueTypeSet to core.	2017-11-30 15:02:07 -08:00
Richard Newman	eaf3e7fc4b	Extend inequalities to Instants. (#439 ) r=fluffyemily,nalexander	2017-06-16 11:57:44 -07:00
Richard Newman	565a0e9ff9	Implement MATCHES throughout SQL machinery.	2017-06-15 10:28:10 -07:00
Nick Alexander	79fa0994b3	Part 3: Handle `ground`. (#469 ) r=nalexander,rnewman This version removes nalexander's lovely matrix code. It turned out that scalar and tuple bindings are sufficiently different from coll and rel -- they can directly apply as values in the query -- that there was no point in jumping through hoops to turn those single values into a matrix. Furthermore, I've standardized us on a Vec<TypedValue> representation for rectangular matrices, which should be much more efficient, but would have required rewriting that code. Finally, coll and rel are sufficiently different from each other -- coll doesn't require processing nested collections -- that my attempts to share code between them fell somewhat flat. I had lots of nice ideas about zipping together cycles and such, but ultimately I ended up with relatively straightforward, if a bit repetitive, code. The next commit will demonstrate the value of this work -- tests that exercised scalar and tuple grounding now collapse down to the simplest possible SQL.	2017-06-09 20:18:31 -07:00
Richard Newman	c6e933c396	Pre: make rule_vars return unique vars.	2017-06-09 20:16:39 -07:00
Richard Newman	4a886aae17	Pre: derive Debug.	2017-06-09 20:16:38 -07:00
Richard Newman	19fc7cddf1	[query] Widen `known_types` correctly in complex `or`. (#424 ) r=nalexander * Part 1: define ValueTypeSet. We're going to use this instead of `HashSet<ValueType>` so that we can clearly express the empty set and the set of all types, and also to encapsulate a switch to `EnumSet`." * Part 2: use ValueTypeSet. * Part 3: fix type expansion. * Part 4: add a test for type extraction from nested `or`. * Review comments. * Review comments: simplify ValueTypeSet.	2017-04-24 14:15:26 -07:00
Richard Newman	bc63744aba	Add :limit to queries (#420 ) r=nalexander * Pre: put query parts in alphabetical order. * Pre: rename 'input' to 'query' in translate tests. * Part 1: parse :limit. * Part 2: validate and escape variable parameters in SQL. * Part 3: algebrize and translate limits.	2017-04-19 16:16:19 -07:00
Richard Newman	60c082b61e	Part 4: pass inputs through algebrizing and execution. (#418 ) This also adds a test that an `UnboundVariables` error is raised if a variable mentioned in the `:in` clause isn't bound.	2017-04-18 13:19:50 -07:00
Richard Newman	35d73d5541	Implement :order. (#415 ) (#416 ) r=nalexander This adds an `:order` keyword to `:find`. If present, the results of the query will be an ordered set, rather than an unordered set; rows will appear in an ordered defined by each `:order` entry. Each can be one of three things: - A var, `?x`, meaning "order by ?x ascending". - A pair, `(asc ?x)`, meaning "order by ?x ascending". - A pair, `(desc ?x)`, meaning "order by ?x descending". Values will be ordered in this sequence for asc, and in reverse for desc: 1. Entity IDs, in ascending numerical order. 2. Booleans, false then true. 3. Timestamps, in ascending numerical order. 4. Longs and doubles, intermixed, in ascending numerical order. 5. Strings, in ascending lexicographic order. 6. Keywords, in ascending lexicographic order, considering the entire ns/name pair as a single string separated by '/'. Subcommits: Pre: make bound_value public. Pre: generalize ErrorKind::UnboundVariable for use in order. Part 1: parse (direction, var) pairs. Part 2: parse :order clause into FindQuery. Part 3: include order variables in algebrized query. We add order variables to :with, so we can reuse its type tag projection logic, and so that we can phrase ordering in terms of variables rather than datoms columns. Part 4: produce SQL for order clauses.	2017-04-17 11:30:31 -07:00
Richard Newman	64acc6a7ee	Support :with (#311 ) (#414 ) r=nalexander * Pre: refactor projector code. * Part 1: maintain 'with' variables in AlgebrizedQuery. * Part 2: include necessary 'with' variables in SQL projection list. The test produces projection elements for `:with`, even though there are no aggregates in the query. This test will need to be adjusted when we optimize this away!	2017-04-17 09:23:55 -07:00
Richard Newman	08d2c613a4	Part 2: expand the definition of a table to include computed tables. This commit: - Defines a new kind of column, distinct from the eavt columns in `DatomsColumn`, to model the rows projected from subqueries. These always name one of two things: a variable, or a variable's type tag. Naturally the two cases are thus `Variable` and `VariableTypeTag`. These are cheap to clone, given that `Variable` is an `Rc<String>`. - Defines `Column` as a wrapper around `DatomsColumn` and `VariableColumn`. Everywhere we used to use `DatomsColumn` we now allow `Column`: particularly in constraints and projections. - Broadens the definition of a table list in the intermediate "query-sql" representation to include a SQL UNION. A UNION is represented as a list of queries and an alias. - Implements translation from a `ComputedTable` to the query-sql representation. In this commit we only project vars, not type tags. Review comment: discuss bind_column_to_var for ValueTypeTag. Review comment: implement From<Vec<T>> for ConsumableVec<T>.	2017-04-12 19:21:33 -07:00
Richard Newman	7948788936	Part 1: define ComputedTable. Complex `or`s are translated to SQL as a subquery -- in particular, a subquery that's a UNION. Conceptually, that subquery is a computed table: `all_datoms` and `datoms` yield rows of e/a/v/tx, and each computed table yields rows of variable bindings. The table itself is a type, `ComputedTable`. Its `Union` case contains everything a subquery needs: a `ConjoiningClauses` and a projection list, which together allow us to build a SQL subquery, and a list of variables that need type code extraction. (This is discussed further in a later commit.) Naturally we also need a way to refer to columns in a computed table. We model this by a new enum case in `DatomsTable`, `Computed`, which maintains an integer value that uniquely identifies a computed table.	2017-04-12 11:13:58 -07:00
Richard Newman	98ac559894	Pre: allow initialization of a CC with an arbitrary counter value. Useful for testing.	2017-04-12 11:12:48 -07:00
Richard Newman	b693385495	Part 5: eliminate is_known_empty in favor of empty_because and an accessor.	2017-04-07 12:46:26 -07:00
Richard Newman	72977f52e4	Part 3: reinstate extracted type pruning. When we started expanding and narrowing type sets, it became impossible to conclusively know during pattern application whether a type was known. We now figure that out at the end: if a variable has only a single known type, we don't need to extract its type tag.	2017-04-07 12:46:26 -07:00
Richard Newman	0639c94468	Part 2: implement simple `or`.	2017-04-07 12:46:25 -07:00
Richard Newman	9e5c735460	Part 5: split cc.rs into a 'clauses' module. mod.rs defines the module and ConjoiningClauses itself, complete with methods to record facts and ask it questions. pattern.rs, predicate.rs, resolve.rs, and or.rs include particular functionality around accumulating certain kinds of patterns. Only `or.rs` includes significant new code; the rest is just split.	2017-03-30 19:13:20 -07:00
Richard Newman	997df0b776	Part 1: introduce ColumnIntersection and ColumnAlternation. This provides a limited form of OR and AND for column constraints, allowing simple 'or-join' queries to be expressed on a single table alias.	2017-03-30 19:13:19 -07:00
Richard Newman	8adb6d97fd	Add validation for or-join. r=nalexander	2017-03-27 16:32:45 -07:00
Richard Newman	88df7b3b33	Correctly generate DISTINCT and LIMIT. (#386 ) r=nalexander	2017-03-22 14:02:00 -07:00
Richard Newman	97749833d0	Algebrize and translate numeric constraints. (#306 ) r=nalexander	2017-03-22 10:19:47 -07:00
Richard Newman	1c4e30a906	Pre: switch to taking Patterns by move, not by reference, when algebrizing.	2017-03-22 10:14:15 -07:00
Richard Newman	f5aa6b2c2c	Pre: add mentat_query_algebrizer::errors.	2017-03-22 10:14:15 -07:00
Richard Newman	3d66cb5d0f	Pre: move query algebrizer types to their own file.	2017-03-22 10:13:45 -07:00
Richard Newman	bf38105fef	(#362 ) Part 4: handle unknown attributes by expanding type codes. r=nalexander Also, don't run any SQL at all if an algebrized query is known to return no results.	2017-03-08 17:44:27 -08:00
Richard Newman	e898df8842	Implement basic query limits. (#361 ) r=nalexander	2017-03-08 17:41:42 -08:00
Richard Newman	70b112801c	Implement projection and querying. (#353 ) r=nalexander * Add a failing test for EDN parsing '…'. * Expose a SQLValueType trait to get value_type_tag values out of a ValueType. * Add accessors to FindSpec. * Implement querying. * Implement rudimentary projection. * Export mentat_db::new_connection. * Export symbols from mentat. * Add rudimentary end-to-end query tests.	2017-03-06 14:40:10 -08:00
Richard Newman	91c75f26c8	Expand query algebrizer. r=nalexander	2017-02-23 18:39:49 -08:00
Richard Newman	a9cd9b1e87	Export symbols and string helpers from mentat_query_algebrizer.	2017-02-17 17:54:07 -08:00
Richard Newman	5af7082165	Partly flesh out query algebrizer. (#243 ) r=nalexander	2017-02-15 16:10:59 -08:00
Richard Newman	42f03f55a2	Stub out query algebrizer.	2017-02-15 16:01:22 -08:00

41 commits