sqlite-parquet-vtable

mirror of https://github.com/cldellow/sqlite-parquet-vtable.git synced 2025-07-26 18:53:30 +00:00

Author	SHA1	Message	Date
Colin Dellow	d3ab5ff3e7	Cache clauses -> row group mapping Create a shadow table. For `stats`, it'd be `_stats_rowgroups`. It contains three columns: - the clause (eg `city = 'Dawson Creek'`) - the initial estimate, as a bitmap of rowgroups based on stats - the actual observed rowgroups, as a bitmap This papers over poorly sorted parquet files, at the cost of some disk space. It makes interactive queries much more natural -- drilldown style queries are much faster, as they can leverage work done by previous queries. eg 'SELECT * FROM stats WHERE city = 'Dawson Creek' and question_id >= 1935 and question_id <= 1940` takes ~584ms on first run, but 9ms on subsequent runs. We only create entries when the estimates don't match the actual results. Fixes #6	2018-03-24 23:57:15 -04:00
Colin Dellow	b0c7b229dd	Create queries from templates if needed	2018-03-18 17:50:39 -04:00
Colin Dellow	7f2042742b	Also compare queries against SQLite itself	2018-03-18 17:49:12 -04:00
Colin Dellow	65ea1b2f61	Rewrite tests for automatic generation Regularize the parquets - nulls and nonulls each come in 3 variants, with 1, 10 and 99 rows per rowgroup. All test queries are written against nullsA, no_nullsA. Next commit will introduce a tool to expand these template queries to go against the actual tables.	2018-03-18 13:11:29 -04:00
Colin Dellow	110e3e3668	row group skipping for is [not] null queries	2018-03-12 21:09:00 -04:00
Colin Dellow	5559a7b563	Fix when last rowgroup is not same size as first ...change test data to use 99 rows, so that when we have rowgroup size 10 we exercise this code.	2018-03-11 15:15:27 -04:00
Colin Dellow	96fcafcd2f	Add test cases	2018-03-10 13:25:13 -05:00
Colin Dellow	b7c134efc0	test-queries: can debug a testcase `tests/test-queries regex` filters the test cases. If the resulting set has only one test case, run it under gdb.	2018-03-10 11:54:36 -05:00
Colin Dellow	35fcde926c	Rewrite SQL oracle harness	2018-03-07 20:20:34 -05:00
Colin Dellow	67005623df	`ensureColumn` catches up when rows are skipped	2018-03-04 22:29:35 -05:00
Colin Dellow	bb3a9440f7	Add query test framework, fix xFilter	2018-03-04 21:05:26 -05:00

11 Commits