mentat

Author	SHA1	Message	Date
Nick Alexander	09f1d633b5	Part 4: Parse queries with `rust-peg`. There's an unfortunate conflation here between implementing the query parser in `rust-peg` and moving some validation that now happens at parse time to happen later. The result is that we introduce `ParsedFindQuery` as a less-processed `FindQuery`, and that we only use string errors (which is all `rust-peg` supports) instead of the structured errors in query-parser's errors module. The next commit will address this, on the road to removing the `query-parser` module entirely.	2018-06-04 15:04:39 -07:00
Nick Alexander	1d8d94f887	Part 2: Turn `(type-function ?var)` into `(type ?var type-keyword)`. This is more general (the parser doesn't encode the set of known types), and avoids a dependency on `ValueType`.	2018-06-04 14:52:51 -07:00
Nick Alexander	f1fc9f1846	Part 0: Extract query-parser errors.	2018-06-04 14:52:51 -07:00
Richard Newman	3cba87c74b	Allow pull aliases to be non-namespaced. (#694 ) r=nalexander	2018-05-14 10:45:48 -07:00
Richard Newman	60cb5d2432	Pull improvements (#682 ) r=nalexander * Parse and handle aliased pull attributes. * Allow :db/id to be mentioned as a pull attribute. * Clean up comment. * Remove unused function.	2018-05-13 14:15:36 -07:00
Richard Newman	3dc68bcd38	Combine NamespacedKeyword and Keyword. (#689 ) r=nalexander * Make properties on NamespacedKeyword/NamespacedSymbol private * Use only a single String for NamespacedKeyword/NamespacedSymbol * Review comments. * Remove unsafe code in namespaced_name. Benchmarking shows approximately zero change. * Allow the types of ns and name to differ when constructing a NamespacedName. * Make symbol namespaces optional. * Normalize names of keyword/symbol constructors. This will make the subsequent refactor much less painful. * Use expect not unwrap. * Merge Keyword and NamespacedKeyword.	2018-05-11 09:52:17 -07:00
Richard Newman	e21156a754	Implement simple pull expressions (#638 ) r=nalexander * Refactor AttributeCache populator code for use from pull. * Pre: add to_value_rc to Cloned. * Pre: add From<StructuredMap> for Binding. * Pre: clarify Store::open_empty. * Pre: StructuredMap cleanup. * Pre: clean up a doc test. * Split projector crate. Pass schema to projector. * CLI support for printing bindings. * Add and use ConjoiningClauses::derive_types_from_find_spec. * Define pull types. * Implement pull on top of the attribute cache layer. * Add pull support to the projector. * Parse pull expressions. * Add simple pull support to connection objects. * Tests for pull. * Compile with Rust 1.25. The only choice involved in this commit is that of replacing the anonymous lifetime '_ with a named lifetime for the cache; since we're accepting a Known, which includes the cache in question, I think it's clear that we expect the function to apply to any given cache lifetime. * Review comments. * Bail on unnamed attribute. * Make assert_parse_failure_contains safe to use. * Rework query parser to report better errors for pull. * Test for mixed wildcard and simple attribute.	2018-05-04 12:56:00 -07:00
Richard Newman	f979044ba1	Refactor value type boxing. (#659 ) r=nalexander * Pre: eliminate some occurrences of Rc, largely through the magic of Into. * Pre: introduce FromRc to convert between refcounted types. * Introduce ValueRc as an abstraction over Rc/Arc choice. * Move Cloned to core. * Move CString-creation methods to TypedValue. * Finish transition.	2018-04-25 14:23:27 -07:00
Richard Newman	833ff92436	Simple aggregates. (#584 ) r=emily * Pre: use debugcli in VSCode. * Pre: wrap subqueries in parentheses in output SQL. * Pre: add ExistingColumn. This lets us make reference to columns by name, rather than only pointing to qualified aliases. * Pre: add Into for &str to TypedValue. * Pre: add Store.transact. * Pre: cleanup. * Parse and algebrize simple aggregates. (#312) * Follow-up: print aggregate columns more neatly in the CLI. * Useful ValueTypeSet helpers. * Allow for entity inequalities. * Add 'differ', which is a ref-specialized not-equals. * Add 'unpermute', a function for getting unique, distinct pairs from bindings. * Review comments. * Add 'the' pseudo-aggregation operator. This allows for a corresponding value to be returned when a query includes one 'min' or 'max' aggregate.	2018-03-12 15:18:50 -07:00
Thom Chiovoloni	98502eb68f	Implement type annotations in queries. (#526 ) r=rnewman	2018-01-29 14:37:53 -08:00
Thom	9740cafdbd	Automatically remove trailing whitespace from text files. (#527 ) r=rnewman This was done using the following shell script: ``` find . -type f -not -path "target" \ '(' -name '.rs' -o -name '.md' -o -name '.toml' ')' -print0 \| \ xargs -0 sed -i '' -E 's/[[:space:]]$//' ``` Which is admittedly imperfect, but manages to hit everything that was a problem in this repo.	2018-01-19 21:21:04 -06:00
Richard Newman	df90c366af	Partial work from simple aggregates work (#497 ) r=nalexander * Pre: make FindQuery, FindSpec, and Element non-Clone. * Pre: make query translator return a Result. * Pre: make projection return a Result. * Pre: refactor query parser in preparation for parsing aggregates. * Pre: rename PredicateFn -> QueryFunction. * Pre: expose more about bound variables from CC. * Pre: move ValueTypeSet to core.	2017-11-30 15:02:07 -08:00
Richard Newman	c600152d78	Update some dependencies. (#492 ) r=etoop * Update some dependencies. * Update rusqlite to 0.12. * Update error-chain to a forked version that implements Sync. * Fix some compiler warnings. * Remove unused imports in tests. * Parse errors no longer naturally print with the expected symbol.	2017-11-21 16:24:08 +00:00
Nick Alexander	b9cbf92205	Part 1: Parse functions in where clauses.	2017-06-09 20:16:39 -07:00
Richard Newman	c6e933c396	Pre: make rule_vars return unique vars.	2017-06-09 20:16:39 -07:00
Richard Newman	d30ad428e8	Pre: take a dependency on maplit to allow BTreeSet literals.	2017-06-09 20:16:39 -07:00
Nick Alexander	9fe31d443d	Pre: Accept EDN vectors in FnArg arguments. Datomic accepts mostly-arbitrary EDN, and it is actually used: for example, the following are all valid, and all mean different things: * `(ground 1 ?x)` * `(ground [1 2 3] [?x ?y ?z])` * `(ground [[1 2 3] [4 5 6]] [[?x ?y ?z]])` We could probably introduce new syntax that expresses these patterns while avoiding collection arguments, but I don't see one right now. I've elected to support only vectors for simplicity; I'm hoping to avoid parsing edn::Value in the query-algebrizer.	2017-06-09 20:16:36 -07:00
Nick Alexander	d1ac752de6	Parse without copying; parse keyword maps using macros. This is a big commit, but it breaks into two conceptual pieces. The first is to "parse without copying". We replace a stream of an owned collection of edn::ValueAndSpan and instead have a stream of a borrowed collection of &edn::ValueAndSpan references. (Generally, this is represented as an iterator over a slice, but it can be over other things too.) Cloning such iterators is constant time, which improves on cloning an owned collection of edn::ValueAndSpan, which is linear time in the length of the collection and additional time depending on the complexity of the EDN values. The second conceptual piece is to parse keyword maps using a special parser and a macro to build the parser implementations. Before, we created a new edn::ValueAndSpan::Map to represent a keyword map in vector form; since we're working with &edn::ValueAndSpan references now, we can't create an &edn::ValueAndSpan reference with an appropriate lifetime. Therefore we generalize the concept of iteration slightly and turn keyword maps in map form into linear iterators by flattening the value maps. This is a potentially obscuring transformation, so we have to take care to protect against some failure cases. (See the comments and the tests in the code.) After these changes, parsing using `combine` is linear time (and reasonably fast).	2017-05-18 10:17:13 -07:00
Richard Newman	daca8def57	UUIDs and instants. Fixes #44 , #45 , #426 , #427 . (#438 ) r=nalexander * Pre: unused import in translate.rs. * Part 2: take a dependency on rusqlite for query arguments. * Part 1: flatten V2 schema into V1. Add UUID and URI. Bump expected ident and bootstrap datom count in tests. * Part 5: parse edn::Value::Uuid. * Part 3: extend ValueType and TypedValue to include Uuid. * Part 4: add Uuid to query arguments. * Part 6: extend db to support Uuid. * Part 8: add a tx-parser test for #f NaN and #uuid. * Part 7: parse and algebrize UUIDs in queries. * Part 1: parse #inst in EDN and throughout query engine. * Part 3: handle instants in db. * Part 2: instants never matches integers in queries. * Part 4: use DateTime for tx_instants. * Add a test for adding and querying UUIDs and instants. * Review comments.	2017-04-28 20:11:55 -07:00
Emily Toop	bd389d2f0d	Parse and Algebrize `not` & `not-join`. (#302 ) (Closes #303 , #389 , #422 ) r=rnewman * Part 1 - Parse `not` and `not-join` * Part 2 - Validate `not` and `not-join` pre-algebrization * Address review comments rnewman. * Remove `WhereNotClause` and populate `NotJoin` with `WhereClause`. * Fix validation for `not` and `not-join`, removing tests that were invalid. * Address rustification comments. * Rebase against `rust` branch. * Part 3 - Add required types for NotJoin. * Implement `PartialEq` for `ConjoiningClauses` so `ComputedTable` can be included inside `ColumnConstraint::NotExists` * Part 4 - Implement `apply_not_join` * Part 5 - Call `apply_not_join` from inside `apply_clause` * Part 6 - Translate `not-join` into `NOT EXISTS` SQL * Address review comments. * Rename `projected` to `unified` to better describe the fact that we are not projecting any variables. * Check for presence of each unified var in either `column_bindings` or `input_bindings` and bail if not there. * Copy over `input_bindings` for each var in `unified`. * Only copy over the first `column_binding` for each variable in `unified` rather than the whole list. * Update tests. * Address review comments. * Make output from Debug for NotExists more useful * Clear up misunderstanding. Any single failing clause in the not will cause the entire not to be considered empty * Address review comments. * Remove Limit requirement from cc_to_exists. * Use Entry.or_insert instead of matching on the entry to add to column_bindings. * Move addition of value_bindings to before apply_clauses on template. * Tidy up tests with some variable reuse. * Addressed nits, * Address review comments. * Move addition of column_bindings to above apply_clause. * Update tests. * Add test to ensure that unbound vars fail * Improve test for unbound variable to check for correct variable and error * address nits	2017-04-28 10:44:11 +01:00
Richard Newman	bc63744aba	Add :limit to queries (#420 ) r=nalexander * Pre: put query parts in alphabetical order. * Pre: rename 'input' to 'query' in translate tests. * Part 1: parse :limit. * Part 2: validate and escape variable parameters in SQL. * Part 3: algebrize and translate limits.	2017-04-19 16:16:19 -07:00
Nick Alexander	ff0147e89c	Review comments: downgrade to error-chain 0.8.1 for Send + Sync bound; use combine::primitive::Error.	2017-04-18 13:19:50 -07:00
Richard Newman	a9a82ea1a7	Part 1: parse :in. We also at this point switch from using `Vec<Variable>` to `BTreeSet<Variable>`. This allows us to guarantee no duplicates later; we'll reject duplicates at parse time.	2017-04-18 13:19:50 -07:00
Richard Newman	35d73d5541	Implement :order. (#415 ) (#416 ) r=nalexander This adds an `:order` keyword to `:find`. If present, the results of the query will be an ordered set, rather than an unordered set; rows will appear in an ordered defined by each `:order` entry. Each can be one of three things: - A var, `?x`, meaning "order by ?x ascending". - A pair, `(asc ?x)`, meaning "order by ?x ascending". - A pair, `(desc ?x)`, meaning "order by ?x descending". Values will be ordered in this sequence for asc, and in reverse for desc: 1. Entity IDs, in ascending numerical order. 2. Booleans, false then true. 3. Timestamps, in ascending numerical order. 4. Longs and doubles, intermixed, in ascending numerical order. 5. Strings, in ascending lexicographic order. 6. Keywords, in ascending lexicographic order, considering the entire ns/name pair as a single string separated by '/'. Subcommits: Pre: make bound_value public. Pre: generalize ErrorKind::UnboundVariable for use in order. Part 1: parse (direction, var) pairs. Part 2: parse :order clause into FindQuery. Part 3: include order variables in algebrized query. We add order variables to :with, so we can reuse its type tag projection logic, and so that we can phrase ordering in terms of variables rather than datoms columns. Part 4: produce SQL for order clauses.	2017-04-17 11:30:31 -07:00
Richard Newman	0639c94468	Part 2: implement simple `or`.	2017-04-07 12:46:25 -07:00
Nick Alexander	5369f03464	Improve parsing of nested `edn::ValueAndSpan` streams. r=rnewman (#393 ) * Pre: Expose more in edn. * Pre: Make it easier to work with ValueAndSpan. with_spans() is a temporary hack, needed only because I don't care to parse the bootstrap assertions from text right now. * Part 1a: Add `value_and_span` for parsing nested `edn::ValueAndSpan` instances. I wasn't able to abstract over `edn::Value` and `edn::ValueAndSpan`; there are multiple obstacles. I chose to roll with `edn::ValueAndSpan` since it exposes the additional span information that we will want to form good error messages in the future. * Part 1b: Add keyword_map() parsing an `edn::Value::Vector` into an `edn::Value::map`. * Part 1c: Add `Log`/`.log(...)` for logging parser progress. This is a terrible hack, but it sure helps to debug complicated nested parsers. I don't even know what a principled approach would look like; since our parser combinators are so frequently expressed in code, it's hard to imagine a data-driven interpreter that can help debug things. * Part 2: Use `value_and_span` apparatus in tx-parser/. I break an abstraction boundary by returning a value column `edn::ValueAndSpan` rather than just an `edn::Value`. That is, the transaction processor shouldn't care where the `edn::Value` it is processing arose -- even we care to track that information we should bake it into the `Entity` type. We do this because we need to dynamically parse the value column to support nested maps, and parsing requires a full `edn::ValueAndSpan`. Alternately, we could cheat and fake the spans when parsing nested maps, but that's potentially expensive. * Part 3: Use `value_and_span` apparatus in query-parser/. * Part 4: Use `value_and_span` apparatus in root crate. * Review comment: Make Span and SpanPosition Copy. * Review comment: nits. * Review comment: Make `or` be `or_exactly`. I baked the eof checking directly into the parser, rather than using the skip and eof parsers. I also took the time to restore some tests that were mistakenly commented out. * Review comment: Extract and use def_matches_* macros. * Review comment: .map() as late as possible.	2017-04-06 10:06:28 -07:00
Richard Newman	a5023c70cb	Use Rc for TypedValue, Variable, and query Ident keywords. (#395 ) r=nalexander Part 1, core: use Rc for String and Keyword. Part 2, query: use Rc for Variable. Part 3, sql: use Rc for args in SQLiteQueryBuilder. Part 4, query-algebrizer: use Rc. Part 5, db: use Rc. Part 6, query-parser: use Rc. Part 7, query-projector: use Rc. Part 8, query-translator: use Rc. Part 9, top level: use Rc. Part 10: intern Ident and IdentOrKeyword.	2017-04-02 21:38:36 -07:00
Richard Newman	0d15381e11	Crudely parse `or` and `or-join`. (#388 ) r=nalexander	2017-03-27 16:32:01 -07:00
Richard Newman	d83c8620cd	Implement parsing of query predicates. (#380 ) r=nalexander	2017-03-22 10:19:44 -07:00
Richard Newman	11a9a30d35	Pre: reformat query parser code.	2017-03-22 10:14:05 -07:00
Richard Newman	48312e1ff0	Rebased conversion of mentat_query_parser to use error-chain. r=nalexander This is a tiny bit simpler and more consistent.	2017-02-27 16:16:54 -08:00
Nick Alexander	dcd9bcb1ce	Extract partial storage abstraction; use error-chain throughout. Fixes #328 . r=rnewman (#341 ) * Pre: Drop unneeded tx0 from search results. * Pre: Don't require a schema in some of the DB code. The idea is to separate the transaction applying code, which is schema-aware, from the concrete storage code, which is just concerned with getting bits onto disk. * Pre: Only reference Schema, not DB, in debug module. This is part of a larger separation of the volatile PartitionMap, which is modified every transaction, from the stable Schema, which is infrequently modified. * Pre: Fix indentation. * Extract part of DB to new SchemaTypeChecking trait. * Extract part of DB to new PartitionMapping trait. * Pre: Don't expect :db.part/tx partition to advance when tx fails. This fails right now, because we allocate tx IDs even when we shouldn't. * Sketch a db interface without DB. * Add ValueParseError; use error-chain in tx-parser. This can be simplified when https://github.com/Marwes/combine/issues/86 makes it to a published release, but this unblocks us for now. This converts the `combine` error type `ParseError<&'a [edn::Value]>` to a type with owned `Vec<edn::Value>` collections, re-using `edn::Value::Vector` for making them `Display`. * Pre: Accept Borrow<Schema> instead of just &Schema in debug module. This makes it easy to use Rc<Schema> or Arc<Schema> without inserting &* sigils throughout the code. * Use error-chain in query-parser. There are a few things to point out here: - the fine grained error types have been flattened into one crate-wide error type; it's pretty easy to regain the granularity as needed. - edn::ParseError is automatically lifted to mentat_query_parser::errors::Error; - we use mentat_parser_utils::ValueParser to maintain parsing error information from `combine`. * Patch up top-level. * Review comment: Only `borrow()` once.	2017-02-24 15:33:48 -08:00
Joe Walker	40bca2df6d	Remove most uses of `use foo::*`	2017-02-23 14:09:54 +00:00
Richard Newman	a10f68fdb7	Mark every project as being part of the workspace. r=nalexander This allows `cargo test --all` to work.	2017-02-20 11:04:08 -08:00
Victor Porof	896d7f8f88	Add a span component to edn::Value, r=ncalexan Signed-off-by: Victor Porof <victor.porof@gmail.com>	2017-02-17 18:31:26 +01:00
Richard Newman	2e303f4837	Stub out mentat::q_once. (#289 ) r=nalexander * Leave a pointer to issue 288. * Re-export mentat_db::types::DB from mentat_db. * Parse EDN strings in the query parser. * Export 'public' API from mentat_query_parser's top level. * Stub out mentat::q_once.	2017-02-13 10:30:02 -08:00
Joe Walker	f591c90738	Use mentat-parser-utils in tx-parser. Fixes #235 ; r=rnewman,victorporof Move macros query-parser/…/parser_utils.rs → parser-utils/…/query.rs Signed-off-by: Joe Walker <jwalker@mozilla.com>	2017-02-10 18:30:03 +00:00
Victor Porof	c585715224	Don't depend on num and ordered-float in the db and query crates, r=ncalexan (#223 ) Signed-off-by: Victor Porof <victor.porof@gmail.com>	2017-02-08 12:19:16 +01:00
Richard Newman	5b770a54cd	Parse basic :find and :where clauses. (#211 ) r=nalexander * Make Variable::from_symbol public. * Implement basic parsing of queries. * Use pinned dependencies the hard way to fix Travis. * Bump ordered-float dependency to 0.4.0. * Error coercions to use ?, and finishing the find interface.	2017-02-02 18:32:00 -08:00
Richard Newman	5d74f1ee94	Add utilities for defining parsers. (#218 ) r=vporof satisfy_unwrap and ResultParser go into mentat_parser_utils.	2017-02-02 10:25:05 -08:00
Richard Newman	fcdf759399	Rename parser_utils to mentat_parser_utils, clean up imports. (#234 ) r=vporof	2017-02-02 08:18:04 -08:00
Victor Porof	ba1896b684	Extract assert_parses_to into a parser utility crate, r=rnewman. Fixes #200 Signed-off-by: Victor Porof <vporof@mozilla.com>	2017-02-02 10:17:08 +01:00
Richard Newman	592dec7241	Implement a FromValue trait for SrcVar and Variable. (#227 ) r=nalexander	2017-02-01 15:05:14 -08:00
Richard Newman	f1a55c9f12	Move query-parser test functions into a submodule.	2017-02-01 10:44:53 -08:00
Richard Newman	c6fa14c0c8	Rudimentary printing of EDN values. (#209 ) r=jsantell * Add a little From helper for edn::parse::ParseError. Not used yet. * Ignore more things. * Partly implement Display for edn::Value.	2017-01-28 14:18:17 -08:00
Richard Newman	2592506288	Implement parsing of simple :find expressions. (#196 ) r=nalexander * Test the mentat_query directory on Travis. * Export common types from edn. This allows you to write use edn::{PlainSymbol,Keyword}; instead of use edn:🔣:{PlainSymbol,Keyword}; * Add an edn::Value::is_keyword predicate. * Clean up query, preparing for query-parser. * Make EDN keywords and symbols take Into<String> arguments. * Implement parsing of simple :find lists. * Rustfmt query-parser. Split find and query. * Review comment: values_to_variables now returns a NotAVariableError on failure. * Review comment: rename gimme to to_parsed_value. * Review comment: add comments.	2017-01-25 14:06:19 -08:00
Richard Newman	476f04e27b	Implement a rudimentary Keyword struct and the beginnings of ident/entid.	2017-01-09 12:31:56 -08:00
Richard Newman	b9c439bd00	Use underscores for crate names.	2017-01-06 17:31:26 -08:00
Richard Newman	a665926fe6	Rename to Project Mentat (query-parser).	2017-01-06 17:20:21 -08:00
Brian Grinstead	9b8257a725	Create a new crate for the query parser. Fixes #138 . r=rnewman Starting to work out the project layout for sub-crates. The crate inside query-parser/ is "datomish-query-parser" and the core code in src/ depends on it.	2016-12-16 18:43:47 -08:00

50 commits