doris

Author	SHA1	Message	Date
airborne12	347cceb530	[Feature](inverted index) push count on index down to scan node (#22687 ) Co-authored-by: airborne12 <airborne12@gmail.com>	2023-09-02 22:24:43 +08:00
zzzxl	fe6bae2924	[fix](invert index) supports utf8 and non-utf8 strings (#22570 ) supports utf8 and non-utf8 strings: [fix] compatible with utf8 and invalid utf8 doris-thirdparty#110	2023-08-05 12:52:53 +08:00
Kang	7fe08c74fe	[fix](inverted index) return empty result instead of error for empty match query (#22592 ) return empty result instead of error for empty match query as follows: `SELECT * FROM t WHERE msg MATCH ''` `SELECT * FROM t WHERE msg MATCH 'stop_word'`	2023-08-04 17:36:32 +08:00
airborne12	0cd5183556	[Refactor](inverted index) refact tokenize function for inverted index (#22313 )	2023-08-02 19:12:22 +08:00
airborne12	fc2b9db0ad	[Feature](inverted index) add tokenize function for inverted index (#21813 ) In this PR, we introduce TOKENIZE function for inverted index, it is used as following: ``` SELECT TOKENIZE('I love my country', 'english'); ``` It has two arguments, first is text which has to be tokenized, the second is parser type which can be english, chinese or unicode. It also can be used with existing table, like this: ``` mysql> SELECT TOKENIZE(c,"chinese") FROM chinese_analyzer_test; +---------------------------------------+ \| tokenize(`c`, 'chinese') \| +---------------------------------------+ \| ["来到", "北京", "清华大学"] \| \| ["我爱你", "中国"] \| \| ["人民", "得到", "更", "实惠"] \| +---------------------------------------+ ```	2023-07-25 15:05:35 +08:00
YueW	c01230f99a	[fix](match) Optimize the logic for match_phrase function filter (#21622 )	2023-07-25 14:22:37 +08:00
Mryange	b656f31cf2	[Enchancement](compatible) show decimalv3 to decimal (#21782 )	2023-07-18 09:17:14 +08:00
YueW	00c48f7d46	[opt](regression case) add more index change case (#21734 )	2023-07-12 21:52:48 +08:00
YueW	c58d5cd81b	[opt](regression case) add more index change regression case (#21633 )	2023-07-08 22:23:09 +08:00
Gabriel	5506faa7b4	[datetimev2](minor) Add scale parameter for datetimev2 (#21176 )	2023-06-27 19:55:35 +08:00
airborne12	1ac8cdec7e	[Fix](inverted index) fix inverted query cache for chinese tokenizer (#21106 ) 1. query cache for chinese tokenizer is confusing when just converting w_char to char. 2. seperate query_type from inverted_index_reader to clean code.	2023-06-25 22:04:02 +08:00
airborne12	6ac0bfeceb	[Feature](inverted index) add unicode parser for inverted index (#21035 )	2023-06-21 20:14:06 +08:00
zhannngchen	cc53391c9a	Revert "[feature](merge-on-write) enable merge on write by default (#… (#21041 )	2023-06-21 18:36:46 +08:00
YueW	656b9ad3da	[enhancement](index) Nereids support no need to read raw data for index column that only in filter conditions (#20605 )	2023-06-09 21:54:48 +08:00
YueW	b62c5a70c7	[fix](match query) fix array column match query failed without inverted index (#20344 )	2023-06-02 21:10:12 +08:00
YueW	adc3acb283	[fix](match) fix match query with compound predicates return -6003 (#20361 )	2023-06-02 18:25:37 +08:00
YueW	6adb3fdf11	[fix](match_phrase) Fix the inconsistent query result for 'match_phrase' after creating index without support_phrase property (#20258 ) if create inverted index without support_phrase property, remaining the match_phrase condition to filter by match function.	2023-05-31 18:09:50 +08:00
airborne12	90b4e127e3	[Feature](inverted index) add parser_mode properties for inverted index parser (#20116 ) We add parser mode for inverted index, usage like this: ``` CREATE TABLE `inverted` ( `FIELD0` text NULL, `FIELD1` text NULL, `FIELD2` text NULL, `FIELD3` text NULL, INDEX idx_name1 (`FIELD0`) USING INVERTED PROPERTIES("parser" = "chinese", "parser_mode" = "fine_grained") COMMENT '', INDEX idx_name2 (`FIELD1`) USING INVERTED PROPERTIES("parser" = "chinese", "parser_mode" = "coarse_grained") COMMENT '' ) ENGINE=OLAP ); ```	2023-05-29 23:21:52 +08:00
Gabriel	55ccddb62c	[Conf](decimalv3) enable decimalv3 by default	2023-05-29 15:38:31 +08:00
YueW	ae352997b4	[Enhancement](alter inverted index) Improve alter inverted index performance with light weight add or drop inverted index (#19063 )	2023-05-28 11:23:07 +08:00
amory	ee34b6de2d	[Refact] (serde) refact mysql serde with data type (#19543 ) refact mysql output (de)serialize with data type serde , avoid accoriding switch case Primitive type writed in mysqlWriter	2023-05-26 14:11:17 +08:00
zhannngchen	5167dc1251	[feature](merge-on-write) enable merge on write by default (#19017 )	2023-05-11 11:10:48 +08:00
Kang	98b8bef05b	[bugfix](inverted index) fix inverted index to support NULL value filter (#18302 )	2023-04-15 13:20:26 +08:00
YueW	1f9372558d	[improve](regression case) Add more inverted index regression case (#18589 ) 1. add more inverted index regression case for unique mow 2. add inverted index case with different data types	2023-04-12 20:40:55 +08:00
YueW	0b8bc51b72	[fix](inverted index) Fix key column match query failed (#18436 ) * [fix](inverted index) Fix key column match query failed * [chore](regression case) add regression case * [fix] fix regression case no order by	2023-04-08 15:45:08 +08:00
Pxl	e77833bfa1	[Bug](materialized-view) fix where clause persistence replay incorrect (#18228 ) fix where clause persistence replay incorrect	2023-04-03 12:49:01 +08:00
YueW	5f2dad29ca	[enhancement](inverted index) Support inverted index without specified parser to use match query (#17110 )	2023-02-24 20:34:55 +08:00
YueW	43eca4f209	[Feature-WIP](inverted index) Implementation for alter inverted index. (#16371 ) implementation for add/drop inverted index.	2023-02-10 17:56:17 +08:00
YueW	bb179b77f7	[Feature-WIP](inverted index) support array type for inverted index reader (#16355 )	2023-02-02 16:14:14 +08:00

29 Commits