doris

Author	SHA1	Message	Date
Mingyu Chen	c43d0e2a75	[Tablet report] Fix bug that tablet report throw NPE. (#2578 ) When processing tablet reports, some tablets carry transaction information. This information is used by the FE to determine whether to publish these transactions or clear these transactions. During this process, Doris may try to obtain the commit information of some deleted partitions, resulting in a null pointer exception.	2019-12-26 15:31:36 +08:00
kangpinghuang	ee64ab55db	Fix segment size (#2549 )	2019-12-26 11:51:53 +08:00
Mingyu Chen	6f3c50a95c	[Document] Add example for using CTE in INSERT operation (#2572 )	2019-12-26 10:00:34 +08:00
ZHAO Chun	37f2dccc96	Support bitshuffle on aarch64 (#2574 )	2019-12-25 22:21:46 +08:00
ZHAO Chun	a76333a400	Support s2 on aarch64 (#2568 )	2019-12-25 18:56:52 +08:00
ZHAO Chun	35503cf8a3	Support glog on aarch64 (#2563 )	2019-12-25 13:56:15 +08:00
HangyuanLiu	4ff1299e0b	Fix ORC build-thirdpart.sh (#2564 )	2019-12-25 11:00:13 +08:00
ZHAO Chun	c8173c689a	Support Openssl on aarch64 platform (#2561 )	2019-12-25 10:53:47 +08:00
HangyuanLiu	6444187908	Fix Bug : Load parquet data during the upgrade may result in data errors (#2556 )	2019-12-24 23:27:33 +08:00
kangpinghuang	7f48bd3c5a	Support bloom filter index for large int type (#2550 )	2019-12-24 19:04:03 +08:00
kangpinghuang	f9685372a1	Fix bloom filter bug #2526 (#2532 )	2019-12-24 07:45:11 +08:00
Mingyu Chen	a511042397	[Export] Forget to set timeout for export job (#2516 )	2019-12-23 18:14:41 +08:00
firetree01	e7be52fa58	Update basic-usage_EN.md (#2530 )	2019-12-23 16:04:27 +08:00
Lishi	20abfc5f6f	Modify stream-load-manual_EN.md (#2528 )	2019-12-23 15:34:19 +08:00
yangzhg	5ff5bf20c9	Fix core dump when using datetime in window function (#2482 )	2019-12-23 09:38:37 +08:00
kangpinghuang	b4d935ab37	Fix compaction with delete rowset bug (#2523 ) [STORAGE][SEGMENTV2] when base compaction rowsets with delete rowset of more than two condition, stats rows_del_filtered is wrong and compaction will fail because of line check.	2019-12-21 12:13:46 +08:00
frwrdt	008e59476d	Add curdate function doc (#2520 )	2019-12-20 21:24:56 +08:00
HangyuanLiu	5b9b0a84d5	Add curdate function (#2521 )	2019-12-20 21:23:16 +08:00
HangyuanLiu	11b78008cd	Timezone variable support one digital time (#2513 ) Support time zone variable like "-8:00","+8:00","8:00" Time zone variable like "-8:00" is illegal in time-zone ID ,so we mush transfer it to standard format	2019-12-20 07:45:29 +08:00
kangkaisen	6815979ba5	Fix invalid to_bitmap input lead to BE core (#2510 )	2019-12-19 21:28:00 +08:00
Mingyu Chen	5111f8cfe8	[Export] Fix bug that NPE may be thrown when executing "show export;" (#2509 ) Some export job from old version of Doris may not has timeout property, which will cause NPE. 2 more changes: 1. Change the default BE config "max_runnings_transactions" to 2000. 2. Add a new metric to FE to show the master ip:port.	2019-12-19 19:09:25 +08:00
EmmyMiao87	49b8097495	Fix the core of get_next in exchange node (#2505 ) The _input_batch hasn't been initialized in exchange node. The undefined behavior will cause that the BE wants to get the capacity of input_batch before BE initialize it. The issue is #2504	2019-12-19 16:40:33 +08:00
vinson0526	435fdd236e	Fix npe in spark-doris-connector when query is complex (#2503 )	2019-12-19 14:53:29 +08:00
HangyuanLiu	45fa9c999e	Add Apache ORC lib in Doris (#2479 )	2019-12-19 11:09:49 +08:00
EmmyMiao87	4220e3b3dc	Merge pull request #2486 from EmmyMiao87/assert_node Only specified function could be supported in correlated subquery	2019-12-19 10:21:06 +08:00
emmymiao87	53132b4199	Chnange the name of specified agg function	2019-12-18 19:35:49 +08:00
WingC	e1ff744a99	[Alter Job] Cancel the alter job after a task failed for 3 times (#2447 ) To avoid waiting timeout when it is a invalid alter job.	2019-12-18 19:17:34 +08:00
emmymiao87	8342eb0b02	Only UDA function could be supported in correlated subquery Those query of issue could not be supported. #2483 #2493 Those query is forbidden: query1: select * from t1 where k1=(select k1 from t2 where t1.k2=t2.k2); query2: select * from t1 where k1=(select distinct k1 from t2 where t1.k2=t2.k2); Only sum, max, min, avg and count function could appear on select clause for correlated subquery. #2420 Those query is legal: query1: select * from t1 where k1=(select avg(k1) from t2 where t1.k2=t2.k2);	2019-12-18 18:56:48 +08:00
kangpinghuang	63ea05f9c7	Add convert tablet rowset type (#2294 ) to solve the issue #2246. scheme is as following: add a optional preferred_rowset_type in TabletMeta for V2 format rollup index tablet add a boolean session variable use_v2_rollup, if set true, the query will v2 storage format rollup index to process the query. test queries will be sent to online service to verify the correctness of segment-v2 by send the the same queries to fe with use_v2_rollup set or not to check whether the returned results are the same.	2019-12-18 18:49:47 +08:00
Youngwb	48f559600f	Fix bug when spark on doris run long time (#2485 )	2019-12-18 13:08:21 +08:00
Mingyu Chen	222f8390c7	[Compaction] Fix the bug that cumulative point grows unreasonably (#2490 ) When there are to many segment in one rowset, which is larger than BE config 'max_cumulative_compaction_num_singleton_deltas', the cumulative compaction will not work and just increase the cumulative point, because there is only once rowset being selected. So when selecting rowset for cumulative compaction, we should meet 2 requirments before finishing the selection logic: 1. compaction score is larger than 'max_cumulative_compaction_num_singleton_deltas' 2. at least 2 rowsets are selected.	2019-12-18 12:59:17 +08:00
WingC	c81b1db406	Support convert VARCHAR type to DATE type (#2489 )	2019-12-18 12:58:47 +08:00
kangpinghuang	d31f774852	Add block split bloom filter (#2471 ) [STORAGE][SEGMENTV2] use block split bloom filter build bloom filter against data page add distinct value to bloom filter add ordinal index to bloom filter index	2019-12-18 12:57:44 +08:00
EmmyMiao87	efd32f7a85	Remove unused import package (#2492 )	2019-12-18 10:55:56 +08:00
WingC	89003b774b	Support Convert Varchar to INT (#2481 )	2019-12-17 22:02:28 +08:00
EmmyMiao87	b1bac4d0cd	Support to create materialized view (#2431 ) Support to create materialized view This commit support to create materiliazed view. The syntax of stmt is following: CREATE Materialized View [MV name] AS SELECT select_expr[, select_expr ...] FROM [Base table name] GROUP BY column_name[, column_name ...] ORDER BY column_name[, column_name ...] The CreateMaterializedViewClause is used to check the semantic of stmt in the first step. Now, the where, having, limit clause is forbidden in CREATE MATERIALIZED VIEW. Also the aggregation function is restricted in SUM/MIN/MAX. The second step is to validate stmt according to metadata of base table. For example, the aggregate type of mv column must be same as the aggregate type of base column in aggregate table. The last step is to prepare index of mv and add this new mvJob in Handler. The handler will asynchronous process this new mvJob.	2019-12-17 21:12:24 +08:00
emmymiao87	3e58e2d543	Forbidden the distinct function of subquery in binary predicate	2019-12-17 19:38:15 +08:00
Mingyu Chen	e1ba0efbc7	Optimize compaction strategy of tablet on BE (#2473 ) The current compaction selection strategy and cumulative point update logic will cause the cumulative compaction to not work, and all compaction tasks will be completed only by the base compaction. This can cause a large number of data versions to pile up. In the current cumulative point update logic, when a cumulative cannot select enough number of rowsets, it will directly increase the cumulative point. Therefore, when the data version generates the same speed as the cumulative compaction polling, it will cause the cumulative point to continuously increase without triggering the cumulative compaction. The new strategy mainly modifies the update logic of cumulative point to ensure that the above problems do not occur. At the same time, the new strategy also takes into account the problem that compaction cannot be performed if cumulative points stagnate for a long time. Cumulative points will be forced to increase through threshold settings to ensure that compaction has a chance to execute. Also add a new HTTP API to view the compaction status of specified tablet. See `compaction-action.md` for details.	2019-12-17 10:30:43 +08:00
landon-dai	55cb1cd1f1	Update date_format.md (#2476 )	2019-12-16 20:43:55 +08:00
landon-dai	b20a76163b	Update from_unixtime.md (#2475 )	2019-12-16 19:39:54 +08:00
kangkaisen	9244db40f7	Update bitmap doc (#2467 )	2019-12-16 18:56:53 +08:00
EmmyMiao87	2c90915362	Support correlated non-scalar subquery (#2468 ) The first item of non-scalar subquery could be non-aggregation function such as column k1. This commit remove this prohibit.	2019-12-16 18:52:05 +08:00
kangkaisen	d00c5e3066	Fix base_compaction minor log error (#2461 )	2019-12-16 13:45:19 +08:00
xy720	c8c32658a7	Fix PIPE operator priority (#2459 ) This commit will promote the priority of the \|\| operator to the front of the + - * / mod operators. It solves the problems 2.1 that mentioned at issue #2396 . For problem at 2.2 in issue #2396 , it is actually the same problem mentioned in issue #2142 . As it said in pr #2398 before, the influence of modifying that logic will cause semantic errors in insert and load, so this commit will left the bug unsolved temporary. appendix: In Mysql 5.7.27 \|\| and \| select 23\|1\|\|7; 23 select (23\|1)\|\|7 237 select 23\|(1\|\|7) 23 Priority : \|\| > \| \|\| and & select 10&1\|\|7; 0 select (10&1)\|\|7 7 select 10&(1\|\|7) 0 Priority : \|\| > & \|\| and ^ select 10^1\|\|7 27 select (10^1)\|\|7 117 select 10^(1\|\|7) 27 Priority : \|\| > ^ \|\| and ~ select ~1\|\|7 184467440737095516147 select ~(1\|\|7) 18446744073709551598 priority : \|\| < ~	2019-12-16 13:44:49 +08:00
Mingyu Chen	e65a645138	Add classes related to "tag". (#2343 ) [Tag System] This CL includes 2 parts: Add classes related to "tag" Resource: is the collective name of the nodes that provide various service capabilities in Doris cluster. Tag: A Tag consists of type and name. TagSet: TagSet represents a set of tags. TagManager: maintains 2 indexes: one is from tag to resource. one is from resource to tags ISSUE #1723 Using JSON as serialization methods of metadata Introduce GSON library to serialize the new classes mentioned above. ISSUE #2415 #2389 GSON's version is updated to 2.8.6	2019-12-15 20:13:29 +08:00
Seaven	e4cc17599f	Add plugin definition (#2351 )	2019-12-13 21:38:17 +08:00
kangkaisen	02c4edb98e	Add more HTTP log (#2458 )	2019-12-13 21:31:48 +08:00
Yunfeng,Wu	a17b28ccc1	Modify FE QueryPlan UT test failure by accident (#2455 )	2019-12-13 21:28:54 +08:00
kangkaisen	cf6d705df9	Add intersect_count UDAF (#2418 ) 1 Because we don't support array type currently, so I use variable arguments instead. 2 intersect_count directly return final count, not bitmap like bitmap_union, because intersect_count return bitmap is more complex and need more serialize. If we really need bitmap format from intersect_count, we could do that in another PR and which won't have compatibility problems.	2019-12-13 16:12:05 +08:00
ZHAO Chun	5b3f61f26d	Update README.md (#2452 ) Change default docker image from build-env to build-env-1.1	2019-12-13 15:55:41 +08:00

1 2 3 4 5 ...

1321 Commits