mentat

Author	SHA1	Message	Date
Emily Toop	e1e7cbaa44	Closes #634 - Fix variables in predicates (#635 ) r=rnewman We were forgetting to check for bound variables when resolving types other than ref types during inequality handling. This patch adds in the binding checks and `bails` if the bound variable is of the wrong type. #634	2018-05-09 16:24:12 +01:00
Richard Newman	e21156a754	Implement simple pull expressions (#638 ) r=nalexander * Refactor AttributeCache populator code for use from pull. * Pre: add to_value_rc to Cloned. * Pre: add From<StructuredMap> for Binding. * Pre: clarify Store::open_empty. * Pre: StructuredMap cleanup. * Pre: clean up a doc test. * Split projector crate. Pass schema to projector. * CLI support for printing bindings. * Add and use ConjoiningClauses::derive_types_from_find_spec. * Define pull types. * Implement pull on top of the attribute cache layer. * Add pull support to the projector. * Parse pull expressions. * Add simple pull support to connection objects. * Tests for pull. * Compile with Rust 1.25. The only choice involved in this commit is that of replacing the anonymous lifetime '_ with a named lifetime for the cache; since we're accepting a Known, which includes the cache in question, I think it's clear that we expect the function to apply to any given cache lifetime. * Review comments. * Bail on unnamed attribute. * Make assert_parse_failure_contains safe to use. * Rework query parser to report better errors for pull. * Test for mixed wildcard and simple attribute.	2018-05-04 12:56:00 -07:00
Richard Newman	f979044ba1	Refactor value type boxing. (#659 ) r=nalexander * Pre: eliminate some occurrences of Rc, largely through the magic of Into. * Pre: introduce FromRc to convert between refcounted types. * Introduce ValueRc as an abstraction over Rc/Arc choice. * Move Cloned to core. * Move CString-creation methods to TypedValue. * Finish transition.	2018-04-25 14:23:27 -07:00
Nick Alexander	c8da4be38f	(query) Implement tx-log API: (tx-ids ...) and (tx-data ...) functions. `tx-ids` allows to enumerate transaction IDs efficiently. `tx-data` allows to extract transaction log data efficiently. We might eventually allow to filter by impacted attribute sets as well.	2018-04-19 09:58:41 -07:00
Nick Alexander	e532614908	(query) Pre: Model columns that don't have type tags closer to Column.	2018-04-19 09:58:41 -07:00
Richard Newman	a57f7aff99	Add specialized tx-before and tx-after predicates. (#599 ) r=emily	2018-04-05 10:49:06 -07:00
Richard Newman	994a3e65e2	Tests and fixes for aggregates over different or unknown types. (#588 ) r=emily	2018-03-15 07:14:06 -07:00
Richard Newman	833ff92436	Simple aggregates. (#584 ) r=emily * Pre: use debugcli in VSCode. * Pre: wrap subqueries in parentheses in output SQL. * Pre: add ExistingColumn. This lets us make reference to columns by name, rather than only pointing to qualified aliases. * Pre: add Into for &str to TypedValue. * Pre: add Store.transact. * Pre: cleanup. * Parse and algebrize simple aggregates. (#312) * Follow-up: print aggregate columns more neatly in the CLI. * Useful ValueTypeSet helpers. * Allow for entity inequalities. * Add 'differ', which is a ref-specialized not-equals. * Add 'unpermute', a function for getting unique, distinct pairs from bindings. * Review comments. * Add 'the' pseudo-aggregation operator. This allows for a corresponding value to be returned when a query includes one 'min' or 'max' aggregate.	2018-03-12 15:18:50 -07:00
Richard Newman	1817ce7c0b	Performance and cleanup. r=emily * Use fixed-size arrays for bootstrap datoms, not vecs. * Wide-ranging cleanup. This commit: - Deletes some dead code. - Marks some functions only used by tests as cfg(test). - Adds pub(crate) to a bunch of functions. - Cleans up a few other nits.	2018-03-06 09:03:00 -08:00
Richard Newman	f42ae35b70	Update cache on write. (#566 ) r=emily * Use the cache to make constant queries super fast. * Fix translate tests to match: we no longer generate SQL for many of them! * Accumulate additions and removals into the cache. * Make attribute cache clone-on-write; store it in Metadata. * Allow caching of fulltext attributes, interning strings.	2018-03-06 09:01:20 -08:00
Richard Newman	d46535a7c2	When an attribute is known-fulltext, don't hit AllDatoms. (#576 ) r=nalexander	2018-03-05 10:09:53 -08:00
Richard Newman	54bd883c65	Follow-up: remove logging and such elsewhere in the codebase.	2018-02-21 11:51:45 -08:00
Richard Newman	e33fe71c47	Rework caching and use it inside the query engine. (#553 ) r=emily This puts caching in mentat_db, adds a reverse lookup capability for unique attributes, and populates bidirectional caches with a single SQL cursor walk. Differentiate between begin_read and begin_uncached_read. Note that we still allow toggling within InProgress, because there might be transient local state that makes starting a new transaction impossible.	2018-02-21 11:51:45 -08:00
Kit Cambridge	a6341f6fd6	Implement `q_prepare` with pre-bound variables. r=rnewman	2018-02-07 21:48:05 -08:00
Richard Newman	37a7c9ea48	Validate attributes installed after open. (#538 ) r=emily Make AttributeBuilder optionally helpful, fix tests.	2018-02-01 09:29:04 -08:00
Thom Chiovoloni	98502eb68f	Implement type annotations in queries. (#526 ) r=rnewman	2018-01-29 14:37:53 -08:00
Richard Newman	4acc6d0658	InProgressRead, KnownEntid. r=nalexander,emily Improve naming of read-only transactions. Implement entid_for_type. Simplify get_attribute. Name ignored var in algebrizer. Comment attribute_for_ident. Make KnownEntid a core concept. Expose lookup_value_for_attribute. Implement HasSchema and a new query encapsulation on Conn. Pre: export Queryable.	2018-01-23 08:40:18 -08:00
Richard Newman	6797a606b5	Preliminary work for vocabulary management. r=emily,nalexander Pre: export AttributeBuilder from mentat_db. Pre: fix module-level comment for tx/src/entities.rs. Pre: rename some `to_` conversions to `into_`. Pre: make AttributeBuilder::unique less verbose. Pre: split out a HasSchema trait to abstract over Schema. Pre: rename SchemaMap/schema_map to AttributeMap/attribute_map. Pre: TypedValue/NamespacedKeyword conversions. Pre: turn Unique and ValueType into TypedValue::Keyword. Pre: export IntoResult. Pre: export NamespacedKeyword from mentat_core. Pre: use intern_set in tx. Pre: add InternSet::len. Pre: comment gardening. Pre: remove inaccurate TODO from TxReport comment.	2018-01-23 08:25:32 -08:00
Thom	9740cafdbd	Automatically remove trailing whitespace from text files. (#527 ) r=rnewman This was done using the following shell script: ``` find . -type f -not -path "target" \ '(' -name '.rs' -o -name '.md' -o -name '.toml' ')' -print0 \| \ xargs -0 sed -i '' -E 's/[[:space:]]$//' ``` Which is admittedly imperfect, but manages to hit everything that was a problem in this repo.	2018-01-19 21:21:04 -06:00
Richard Newman	df90c366af	Partial work from simple aggregates work (#497 ) r=nalexander * Pre: make FindQuery, FindSpec, and Element non-Clone. * Pre: make query translator return a Result. * Pre: make projection return a Result. * Pre: refactor query parser in preparation for parsing aggregates. * Pre: rename PredicateFn -> QueryFunction. * Pre: expose more about bound variables from CC. * Pre: move ValueTypeSet to core.	2017-11-30 15:02:07 -08:00
Emily Toop	c15973f269	Support tx places in queries (#485 ) r=rnewman * Support tx places in queries	2017-06-28 18:20:16 +01:00
Richard Newman	eaf3e7fc4b	Extend inequalities to Instants. (#439 ) r=fluffyemily,nalexander	2017-06-16 11:57:44 -07:00
Richard Newman	20aa11dcbd	Support variable fulltext searches. (#479 ) r=nalexander	2017-06-15 10:32:46 -07:00
Richard Newman	3f264e9eb2	Implement `fulltext`. (#477 ) r=nalexander * You can't use fulltext search on a non-fulltext attribute. * Allow for implicit placeholder bindings in fulltext.	2017-06-15 10:28:11 -07:00
Richard Newman	565a0e9ff9	Implement MATCHES throughout SQL machinery.	2017-06-15 10:28:10 -07:00
Richard Newman	17c59bbff6	Apply newly bound values to existing columns. This commit lifts some logic out of the scalar ground handler to apply elsewhere. When a new value binding is encountered for a variable to which column bindings have already been established, we do two things: - We apply a new constraint to the primary column. This ensures that the behavior for ground-first and ground-second is equivalent. - We eliminate any existing column type extraction: it won't be necessary now that a constant value and constant type are known.	2017-06-15 10:28:09 -07:00
Richard Newman	f7a3fd5b17	Refactor arg conversion and ground into separate files.	2017-06-15 10:28:07 -07:00
Richard Newman	54bdd382fb	Add a test that late inputs aren't allowed in ground.	2017-06-15 10:28:05 -07:00
Richard Newman	e1e549440f	Expand type code when applying ground. (#475 )	2017-06-09 20:18:53 -07:00
Nick Alexander	79fa0994b3	Part 3: Handle `ground`. (#469 ) r=nalexander,rnewman This version removes nalexander's lovely matrix code. It turned out that scalar and tuple bindings are sufficiently different from coll and rel -- they can directly apply as values in the query -- that there was no point in jumping through hoops to turn those single values into a matrix. Furthermore, I've standardized us on a Vec<TypedValue> representation for rectangular matrices, which should be much more efficient, but would have required rewriting that code. Finally, coll and rel are sufficiently different from each other -- coll doesn't require processing nested collections -- that my attempts to share code between them fell somewhat flat. I had lots of nice ideas about zipping together cycles and such, but ultimately I ended up with relatively straightforward, if a bit repetitive, code. The next commit will demonstrate the value of this work -- tests that exercised scalar and tuple grounding now collapse down to the simplest possible SQL.	2017-06-09 20:18:31 -07:00
Richard Newman	c6e933c396	Pre: make rule_vars return unique vars.	2017-06-09 20:16:39 -07:00
Richard Newman	9ac2b8c680	Pre: add ConjoiningClauses::known_type_set.	2017-06-09 20:16:38 -07:00
Richard Newman	899e5d0971	Pre: add ConjoiningClauses::bind_value.	2017-06-09 20:16:38 -07:00
Nick Alexander	13e27c83e2	Pre: Modify predicate implementation in preparation for functions that bind.	2017-06-09 20:16:38 -07:00
Nick Alexander	4d2eb7222e	Pre: Generalize NonNumericArgument to InvalidArgument.	2017-06-09 20:16:37 -07:00
Nick Alexander	002c918c96	Pre: Move PushComputed up module hierarchy; make it public.	2017-06-09 20:16:37 -07:00
Nick Alexander	9fe31d443d	Pre: Accept EDN vectors in FnArg arguments. Datomic accepts mostly-arbitrary EDN, and it is actually used: for example, the following are all valid, and all mean different things: * `(ground 1 ?x)` * `(ground [1 2 3] [?x ?y ?z])` * `(ground [[1 2 3] [4 5 6]] [[?x ?y ?z]])` We could probably introduce new syntax that expresses these patterns while avoiding collection arguments, but I don't see one right now. I've elected to support only vectors for simplicity; I'm hoping to avoid parsing edn::Value in the query-algebrizer.	2017-06-09 20:16:36 -07:00
Nick Alexander	08534a1a3a	Pre: Handle SrcVar.	2017-06-09 20:16:36 -07:00
Richard Newman	daca8def57	UUIDs and instants. Fixes #44 , #45 , #426 , #427 . (#438 ) r=nalexander * Pre: unused import in translate.rs. * Part 2: take a dependency on rusqlite for query arguments. * Part 1: flatten V2 schema into V1. Add UUID and URI. Bump expected ident and bootstrap datom count in tests. * Part 5: parse edn::Value::Uuid. * Part 3: extend ValueType and TypedValue to include Uuid. * Part 4: add Uuid to query arguments. * Part 6: extend db to support Uuid. * Part 8: add a tx-parser test for #f NaN and #uuid. * Part 7: parse and algebrize UUIDs in queries. * Part 1: parse #inst in EDN and throughout query engine. * Part 3: handle instants in db. * Part 2: instants never matches integers in queries. * Part 4: use DateTime for tx_instants. * Add a test for adding and querying UUIDs and instants. * Review comments.	2017-04-28 20:11:55 -07:00
Emily Toop	bd389d2f0d	Parse and Algebrize `not` & `not-join`. (#302 ) (Closes #303 , #389 , #422 ) r=rnewman * Part 1 - Parse `not` and `not-join` * Part 2 - Validate `not` and `not-join` pre-algebrization * Address review comments rnewman. * Remove `WhereNotClause` and populate `NotJoin` with `WhereClause`. * Fix validation for `not` and `not-join`, removing tests that were invalid. * Address rustification comments. * Rebase against `rust` branch. * Part 3 - Add required types for NotJoin. * Implement `PartialEq` for `ConjoiningClauses` so `ComputedTable` can be included inside `ColumnConstraint::NotExists` * Part 4 - Implement `apply_not_join` * Part 5 - Call `apply_not_join` from inside `apply_clause` * Part 6 - Translate `not-join` into `NOT EXISTS` SQL * Address review comments. * Rename `projected` to `unified` to better describe the fact that we are not projecting any variables. * Check for presence of each unified var in either `column_bindings` or `input_bindings` and bail if not there. * Copy over `input_bindings` for each var in `unified`. * Only copy over the first `column_binding` for each variable in `unified` rather than the whole list. * Update tests. * Address review comments. * Make output from Debug for NotExists more useful * Clear up misunderstanding. Any single failing clause in the not will cause the entire not to be considered empty * Address review comments. * Remove Limit requirement from cc_to_exists. * Use Entry.or_insert instead of matching on the entry to add to column_bindings. * Move addition of value_bindings to before apply_clauses on template. * Tidy up tests with some variable reuse. * Addressed nits, * Address review comments. * Move addition of column_bindings to above apply_clause. * Update tests. * Add test to ensure that unbound vars fail * Improve test for unbound variable to check for correct variable and error * address nits	2017-04-28 10:44:11 +01:00
Richard Newman	19fc7cddf1	[query] Widen `known_types` correctly in complex `or`. (#424 ) r=nalexander * Part 1: define ValueTypeSet. We're going to use this instead of `HashSet<ValueType>` so that we can clearly express the empty set and the set of all types, and also to encapsulate a switch to `EnumSet`." * Part 2: use ValueTypeSet. * Part 3: fix type expansion. * Part 4: add a test for type extraction from nested `or`. * Review comments. * Review comments: simplify ValueTypeSet.	2017-04-24 14:15:26 -07:00
Richard Newman	bc63744aba	Add :limit to queries (#420 ) r=nalexander * Pre: put query parts in alphabetical order. * Pre: rename 'input' to 'query' in translate tests. * Part 1: parse :limit. * Part 2: validate and escape variable parameters in SQL. * Part 3: algebrize and translate limits.	2017-04-19 16:16:19 -07:00
Richard Newman	bffefe7e6b	Review comments for #418 .	2017-04-18 13:50:58 -07:00
Richard Newman	60c082b61e	Part 4: pass inputs through algebrizing and execution. (#418 ) This also adds a test that an `UnboundVariables` error is raised if a variable mentioned in the `:in` clause isn't bound.	2017-04-18 13:19:50 -07:00
Richard Newman	dfc846e483	Part 3: define keep_intersected_keys. We'll use this to drop unneeded values from input maps, if lazy callers reuse a general-purpose map for multiple queries.	2017-04-18 13:19:50 -07:00
Richard Newman	651308f721	Part 2: define a type to encapsulate query inputs. This is for two reasons. Firstly, we need to track the types of inputs, their values, and also the input variables; adding a struct gives us a little more clarity. Secondly, when we come to implement prepared statements, we'll be algebrizing queries without having the values available. We'll be able to do a better job of algebrizing, and also do more validating, if we allow callers to specify the types of variables in advance, even if the values aren't known.	2017-04-18 13:19:50 -07:00
Richard Newman	35d73d5541	Implement :order. (#415 ) (#416 ) r=nalexander This adds an `:order` keyword to `:find`. If present, the results of the query will be an ordered set, rather than an unordered set; rows will appear in an ordered defined by each `:order` entry. Each can be one of three things: - A var, `?x`, meaning "order by ?x ascending". - A pair, `(asc ?x)`, meaning "order by ?x ascending". - A pair, `(desc ?x)`, meaning "order by ?x descending". Values will be ordered in this sequence for asc, and in reverse for desc: 1. Entity IDs, in ascending numerical order. 2. Booleans, false then true. 3. Timestamps, in ascending numerical order. 4. Longs and doubles, intermixed, in ascending numerical order. 5. Strings, in ascending lexicographic order. 6. Keywords, in ascending lexicographic order, considering the entire ns/name pair as a single string separated by '/'. Subcommits: Pre: make bound_value public. Pre: generalize ErrorKind::UnboundVariable for use in order. Part 1: parse (direction, var) pairs. Part 2: parse :order clause into FindQuery. Part 3: include order variables in algebrized query. We add order variables to :with, so we can reuse its type tag projection logic, and so that we can phrase ordering in terms of variables rather than datoms columns. Part 4: produce SQL for order clauses.	2017-04-17 11:30:31 -07:00
Richard Newman	758ab8b476	Part 5: add more tests for complex `or`.	2017-04-12 19:21:56 -07:00
Richard Newman	d8075aa07d	Part 3: finish expansion and translation of complex `or`. This commit turns complex `or` -- `or`s in which not all variables are unified, or in which not all arms are the same shape -- into a computed table. We do this by building a template CC that shares some state with the destination CC, applying each arm of the `or` to a copy of the template as if it were a standalone query, then building a projection list and creating a `ComputedTable::Union`. This is pushed into the destination CC's `computed_tables` list. Finally, the variables projected from the UNION are bound in the destination CC, so that unification occurs, and projection of the outermost query can use bindings established by the `or-join`. This commit includes projection of type codes from heterogeneous `UNION` arms: we compute a list of variables for which a definite type is unknown in at least one arm, and force all arms to project either a type tag column or a fixed type. It's important that each branch of a UNION project the same columns in the same order, hence the projection of fixed values. The translator is similarly extended to project the type tag column name or the known value_type_tag to support this. Review comment: clarify union type extraction.	2017-04-12 19:21:45 -07:00
Richard Newman	08d2c613a4	Part 2: expand the definition of a table to include computed tables. This commit: - Defines a new kind of column, distinct from the eavt columns in `DatomsColumn`, to model the rows projected from subqueries. These always name one of two things: a variable, or a variable's type tag. Naturally the two cases are thus `Variable` and `VariableTypeTag`. These are cheap to clone, given that `Variable` is an `Rc<String>`. - Defines `Column` as a wrapper around `DatomsColumn` and `VariableColumn`. Everywhere we used to use `DatomsColumn` we now allow `Column`: particularly in constraints and projections. - Broadens the definition of a table list in the intermediate "query-sql" representation to include a SQL UNION. A UNION is represented as a list of queries and an alias. - Implements translation from a `ComputedTable` to the query-sql representation. In this commit we only project vars, not type tags. Review comment: discuss bind_column_to_var for ValueTypeTag. Review comment: implement From<Vec<T>> for ConsumableVec<T>.	2017-04-12 19:21:33 -07:00

1 2

66 commits