doris

Author	SHA1	Message	Date
Kikyou1997	b3859e1e1a	[ehancement](fe) Remove unnecessary kill in AutoCloseConnectContext (#14606 ) The invocation in ConnectContext.kill in AutoCloseConnectContext is redundant and caused too many useless logs	2022-11-26 23:54:33 +08:00
Tiewei Fang	36419fae48	[fix](JdbcExecutor) fix that JdbcExecutor did not load the class jar (#14598 ) JdbcExecutor did not load jdbc driver jar, so add classloader to load jdbc jar.	2022-11-26 23:53:05 +08:00
Mingyu Chen	064b8d2aa6	[fix](multi-catalog) fix coredump when querying partitioned hive table with text format (#14604 ) BE will crash when querying partitioned hive table with text format and put partition column at first of select items. 1. FE should use file slots to set the column mapping index of csv file. 2. BE should use `get_by_name` of block to get right column in a block in csv reader.	2022-11-26 11:42:40 +08:00
Kang	52c6ba051e	[feature](jsonb type)refactor JSONB type using column and add testcase (#13778 ) 1. Refactor JSONB type using ColumnString instead making a copy. 2. Add regression testcase for JSONB load and functions.	2022-11-26 10:06:15 +08:00
xiaojunjie	2ae7dae925	[feature](nereids) Support row policy (#13879 ) This pr did two things: 1. 【new logical plan】add LogicalCheckPolicy before UnboundRelation in LogicalPlanBuilder. 2. 【new rule】turn LogicalCheckPolicy to LogicalFilter if row policy exist, otherwise remove it.	2022-11-25 22:57:56 +08:00
yiguolei	494f35c26b	[fuzzy](test) disable some fuzzy variables since it has bugs (#14583 ) Co-authored-by: yiguolei <yiguolei@gmail.com>	2022-11-25 21:15:10 +08:00
Jibing-Li	45fa2fc56b	[fix](multi catalog)Use -1 as external es table column id instead of uniq id (#14557 ) Using cache to store external table columns, doesn't persist uniq id for external columns anymore. So use -1 as column id for ES external table. Avoid non-master FE trying to get uniq id problem. The problem will cause non-master FE fail to write bdbje.	2022-11-25 16:13:16 +08:00
谢健	9630257704	[fix](Nereids): fix bugs in random construct join plan (#14575 )	2022-11-25 16:05:29 +08:00
luozenglin	4728e75079	[feature](bitmap) Support in bitmap syntax and bitmap runtime filter (#14340 ) 1.Support in bitmap syntax, like 'where k1 in (select bitmap_column from tbl)'; 2.Support bitmap runtime filter. Generate a bitmap filter using the right table bitmap and push it down to the left table storage layer for filtering.	2022-11-25 15:22:44 +08:00
zhannngchen	5efdcb9ed0	[improvement](storage) For debugging problem: add session variable (#14576 )	2022-11-25 14:16:00 +08:00
zhangstar333	d5d356b17f	[vectorized](function) support order by field function (#14528 ) * [vectorized](function) support order by field function * update * update test	2022-11-25 14:00:46 +08:00
924060929	deef491e01	[fix](Nereids) refactor CTE and EliminateAliasNode and fix the bug that CTE reuse relationId (#14534 ) This pr contribute: - support explain CTE; - refine CTE, fix the bug: reuse the same analyzed plan which LogicalOlapScan has the same relationId; - change EliminateAliasNode to LogicalSubQueryAliasToLogicalProject and move to the top of rewrite stage, so we can simply observe the analyzed plan by the LogicalSubQueryAlias with alias; - job traverse left child first, so the ExprId growth from left child to right child.	2022-11-25 10:54:53 +08:00
Mingyu Chen	5ccc875824	[fix](recycle) refactor the logic of erase meta with same name (#14551 ) in #14482, we implement the feature to keep specific number of meta with same name in catalog recycle bin. But it will cause meta replay bug. Because every time we drop db/table/partition, it will try to erase a certain number of meta with same name. And when replay "drop" edit log, it will do same thing. But the number of meta to erase it based on current config value, not persist in edit log, so it will cause inconsistency with "drop" and "replay drop". In this PR, I move the "erase meta with same name" logic to the daemon thread of catalog recycle bin.	2022-11-25 09:47:24 +08:00
Kikyou1997	d12112b930	[fix](fe) Fix mem leaks (#14570 ) 1. Fix memory leaks in StmtExecutor::executeInternalQuery 2. Limit the number of concurrent running load task for statistics cache	2022-11-25 09:16:54 +08:00
Jerry Hu	9103ded1dd	[improvement](join)optimize sharing hash table for broadcast join (#14371 ) This PR is to make sharing hash table for broadcast more robust: Add a session variable to enable/disable this function. Do not block the hash join node's close function. Use shared pointer to share hash table and runtime filter in broadcast join nodes. The Hash join node that doesn't need to build the hash table will close the right child without reading any data(the child will close the corresponding sender).	2022-11-24 21:06:44 +08:00
zy-kkk	59b31a03c4	[Improvement](agg function) support group_bit_and/group_bit_or/group_bit_xor functions (#14386 )	2022-11-24 16:46:42 +08:00
zhengshiJ	a04e1b49ec	[feature](Nereids) Implement group by grouping sets, cube and rollup (#14496 ) Issue Number: close #13615 The main work: implement grouping sets/ cube/ rollup. fix if function Infinite loop problem. Support for isNull transitions to legacy optimizers.	2022-11-24 16:34:31 +08:00
minghong	0680b3b4d5	[opt](nereids) adjust nereids related regression test cases (#14439 ) 1. in dateV2, we adjust the dir structure to avoid creating a tpch-1G database 2. use `drop table XXX` to replace `delete * from XXX where key>0` 3. remove explain cases, because - the explain string itself is variable, and the case is hard to maintain - it is original planner explain, not nereids	2022-11-24 16:02:52 +08:00
谢健	fde474609e	[feature](Nereids) Add dphyp job (#14485 )	2022-11-24 15:50:05 +08:00
lihangyu	8afe298a0f	[Fix](function) fix function `retention` lost `ARRAY`'s element type … (#14538 )	2022-11-24 15:19:50 +08:00
TengJianPing	6c7f758ef7	[improvement](hashjoin) support partitioned hash table in hash join (#14480 )	2022-11-24 14:16:47 +08:00
Kikyou1997	e656dae3f0	[fix](fe) fix leaks of connect context (#14529 ) Remove ConnectContext which built for internal statistics from threadlocal to avoid memory leaks	2022-11-24 13:26:59 +08:00
starocean999	ae4f4b9bf1	[fix](agg)having clause should use column name first then alias (#14408 ) * [fix](agg)having clause should use column name first then alias * fix fe ut	2022-11-24 10:31:58 +08:00
Mingyu Chen	6ccdaf0aaf	[fix](storage-policy) use Long instead of Date to persiste cooldowntime in storage policy (#14532 ) Previously, we use "Date" type for cooldownTime in StoragePolicy. But the serialization method of Date type in Gson is different in java8 and java11, which may cause inconsistent meta error. This PR use Long to save cooldownTime. And notice that in FE, the cooldownTime is saved in milliseconds, and in BE, it is saved in seconds.	2022-11-24 08:32:21 +08:00
Gabriel	496a92b668	[JavaUDF](loader) Fix compatible problem for JAVA 11 (#14519 )	2022-11-23 23:36:39 +08:00
Jibing-Li	404cac42f9	[fix](multi catalog)Fix external table partition name and type inconsistent bug. (#14522 ) Origin code using Set to store hms external table partition columns, which couldn't guarantee the order of the columns. This could cause the column name and column type doesn't match. Using List instead of Set to fix the problem.	2022-11-23 21:40:44 +08:00
morrySnow	8d5eabb64f	[enhancement](Nereids) reduce CostAndEnforcerJob call times (#14442 ) record pruned plan's cost to avoid optimize same GroupExpression more than once.	2022-11-23 16:57:41 +08:00
谢健	45975dd321	[enhancement](Nereids): Change circle detector for better performance (#14438 )	2022-11-23 14:31:14 +08:00
minghong	7a7e714fce	[fix](nereids) width and penalty not derive when do stats derive (#14474 ) a previous pr (#13883) refactor stats derive code, but missed width and penalty.	2022-11-23 14:26:51 +08:00
minghong	fb385dcf23	[opt](nereids) make fragment id in explain get inline with profile (#14421 ) Nereids assign fragment ID in its own way. The fragment Id in explain is different from the fragment id in profile. This difference makes trouble to understand profile. This pr aims to print fragment id in explain the same as that in profile.·	2022-11-23 14:14:20 +08:00
xueweizhang	7955e52b3e	[fix](version) fix recover bug for lower version (#14457 )	2022-11-23 14:05:17 +08:00
xueweizhang	79688c34a1	[feature](catalog) add max num of same name meta information in catalog recycle bin (#14482 )	2022-11-23 14:04:14 +08:00
starocean999	d36b561520	[fix](in)fix in predicate datatype mismatch after union (#14497 )	2022-11-23 09:57:03 +08:00
xueweizhang	2eca51f3ba	[enhancement](broker) broker load support tencent cos (#12801 )	2022-11-22 21:51:15 +08:00
Mingyu Chen	6eeebd47a9	[improvement](doc) add missing documents (#14460 )	2022-11-22 21:42:00 +08:00
Kikyou1997	3360bdf124	[feature-wip](statistics) update cache when analysis job finished (#14370 ) 1. Update cache when analysis job finished 2. Rename `StatisticsStorageInitializer` to `InernalSchemaInitializer`	2022-11-22 21:33:10 +08:00
shee	89c676e597	[Bug] fix bug for grouping set query which where condition is false (#14401 )	2022-11-22 16:03:43 +08:00
yinzhijian	663f7dddcc	[improvement](planner) eliminating useless sort node (#14377 )	2022-11-22 15:13:25 +08:00
shee	730cd1a0c1	[Feature](Nereids) Simplify range of predicate (#14113 ) Simplify range of predicate for example： 1. `a > 1 or a > 2` => `a > 1` 2. `a in (1,2,3) or a (3,4,5)` => `a in (1,2,3,4,5)`	2022-11-21 20:24:03 +08:00
ChenJiaHao	91bd76a902	[enhancement](FE) use forEach() to replace stream().forEach() (#14039 )	2022-11-21 15:40:43 +08:00
谢健	a91fe11b4d	[feature](Nereids) Add random test framework (#14388 )	2022-11-21 15:16:03 +08:00
zy-kkk	ce489cf723	[Feature](JDBC)support clickhouse jdbc external table (#14244 )	2022-11-21 10:33:53 +08:00
xueweizhang	a9a6fdd8c3	[fix](insert) fix insert into table which contains column name prefix mv_ (#14361 )	2022-11-21 10:31:01 +08:00
周翱	4976021bf7	[Enhancement] Doris broker support aliyun-oss #13665 (#14305 )	2022-11-21 10:29:14 +08:00
zhannngchen	3e1e8db173	[fix](exec) fix thread token shutdown (#14418 ) Fix Thread pool token was shut down error. This is because when there are more than 1 fragment of a query on one BE, the thread token maybe reset incorrectly, causing thread token shutdown earlier. cherry-pick from master Introduced from #13021	2022-11-20 00:04:48 +08:00
Gabriel	2c42f0a905	[refactor](decimalv3) Refine code for DecimalV3 (#14394 )	2022-11-19 16:57:17 +08:00
caiconghui	1f2c06dd6e	[enhancement](rewrite) Remove unused wide common factors to improve scan performance in ExtractCommonFactorsRule (#14381 ) * [enhancemeng](sql) Remove unused wide common factors to improve scan performance in ExtractCommonFactorsRule * fix regression test Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2022-11-19 13:23:49 +08:00
FreeOnePlus	f5f2e84e31	[refactor](planner) remove the limit return rows of order by (#12478 ) Originally, Order By Limit returned a maximum of 65535 rows of data by default during the query, but now many businesses do not apply this limit. It is necessary to add larger data after the query statement to complete the full data query, which is extremely inconvenient, so adjustments have been made. At the same time, I added the variable DEFAULT_ORDER_BY_LIMIT to the SessionVariable, the default value is -1, if the user does not use the LIMIT keyword or the LIMIT value is a negative integer, the default query return value is Long.MAX_VALUE. If the corresponding maximum query value is set, the number of data items is returned according to the maximum query value or the value followed by the LIMIT keyword.	2022-11-19 12:45:44 +08:00
gnehil	1b6e872a8a	[improvement](common) table name length exceeds limit error message (#14368 ) For the table name check, the regular match error and the length exceeds the limit, both of which display the message "Incorrect table name 'xxx'. Table name regex is 'xxx'". Obviously, the message cannot clearly point out what kind of error it is. So it is a better way to separate the two error messages.	2022-11-19 11:36:08 +08:00
924060929	63a2344e68	[Enhancement](Nereids) Refactor AggregateFunction and support explain plan (#14380 ) # Proposed changes - Refactor AggregateFunction 1. AggregateFunction implement ComputeSignature 3. Add a CustomSignature to dynamic compute signature, we can check input type and compute implicit cast type in the `customSignature` method 2. Add PartialAggType to record some type information before disassemble aggregate 4. Refine and create a custom catalog function when translate AggregateFunction, without `finalizeForNereids` - Support explain plan 1. explain parsed plan select ... 5. explain analyzed plan select ... 6. explain rewritten/logical plan select ... 7. explain optimized/physical plan select ... 8. explain all plan select ...	2022-11-18 23:40:33 +08:00

1 2 3 4 5 ...

3149 Commits