doris

Author	SHA1	Message	Date
yinzhijian	2a96d7ffde	[spell] Fix spell error in row_batch.h (#10109 )	2022-06-14 15:28:29 +08:00
yinzhijian	622143f87c	[typo] Fix typos in comments (#10111 )	2022-06-14 15:28:11 +08:00
yinzhijian	9203a235e0	[typo] Fix typos in runtime_state.cpp (#10112 )	2022-06-14 15:27:40 +08:00
Pxl	e58cac1f00	[build] use inline to replace static (#10087 )	2022-06-14 09:18:15 +08:00
Zhengguo Yang	39a2785ce2	[enhancement] support simd instructions on arm cpus through sse2neon (#10068 ) * [enhancement] support simd instructions on arm cpus through sse2neon	2022-06-14 09:17:09 +08:00
zxealous	d4d2e82bdf	[typo] Fix typos in comments (#10106 )	2022-06-14 08:17:19 +08:00
jacktengg	ce730293c0	[improvement] send merged runtime filter asynchrously (#10080 )	2022-06-14 08:16:25 +08:00
Xinyi Zou	d58e00c49c	[fix](brpc) Embed serialized request into the attachment and transmit it through http brpc (#9803 ) When the length of `Tuple/Block data` is greater than 2G, serialize the protoBuf request and embed the `Tuple/Block data` into the controller attachment and transmit it through http brpc. This is to avoid errors when the length of the protoBuf request exceeds 2G: `Bad request, error_text=[E1003]Fail to compress request`. In #7164, `Tuple/Block data` was put into attachment and sent via default `baidu_std brpc`, but when the attachment exceeds 2G, it will be truncated. There is no 2G limit for sending via `http brpc`. Also, in #7921, consider putting `Tuple/Block data` into attachment transport by default, as this theoretically reduces one serialization and improves performance. However, the test found that the performance did not improve, but the memory peak increased due to the addition of a memory copy.	2022-06-13 20:41:48 +08:00
Gabriel	8af9339b00	[BUGFIX] Fix wrong column types in result file sink (#10079 )	2022-06-13 09:05:11 +08:00
Jerry Hu	797f6e1472	[Enhancement]Decode bitshuffle data before adding it into PageCache (#10036 ) * [Enhancement]Decode bitshuffle data before add into PageCache * Fix be ut failed	2022-06-13 09:04:23 +08:00
Adonis Ling	415b6b8086	[feature-wip](array-type) Support array type which doesn't contain null (#9809 )	2022-06-12 23:35:28 +08:00
carlvinhust2012	990a2940ca	[metric] add some metrics for cpu and memory (#9887 ) 1. add some metrics for cpu monitor; 2. add metrics for process state monitor; 3. add metrics for memory monitor; It is convenient for us to use grafana to filter through different conditions. After the added, we can find the cpu metrics like this： doris_be_cpu{device="cpu1",mode="guest_nice"} 0 doris_be_cpu{device="cpu1",mode="guest"} 0 doris_be_cpu{device="cpu1",mode="steal"} 0 doris_be_cpu{device="cpu1",mode="soft_irq"} 107168 doris_be_cpu{device="cpu1",mode="irq"} 0 doris_be_cpu{device="cpu1",mode="iowait"} 3726931 doris_be_cpu{device="cpu1",mode="idle"} 2358039214 doris_be_cpu{device="cpu1",mode="system"} 58699464 doris_be_cpu{device="cpu1",mode="nice"} 1700438 doris_be_cpu{device="cpu1",mode="user"} 54974091 we can find the memory metrics as follow： doris_be_memory_pswpin 167785 doris_be_memory_pswpout 203724 doris_be_memory_pgpgin 22308762092 doris_be_memory_pgpgout 152101956232 we also can find the process metrics as follow: doris_be_proc{mode="interrupt"} 421721020416 doris_be_proc{mode="ctxt_switch"} 2806640907317 doris_be_proc{mode="procs_running"} 8 doris_be_proc{mode="procs_blocked"} 3	2022-06-10 19:45:31 +08:00
Zhengguo Yang	e0cf2677a0	[dependency][enhancement] support build libhdfs in arm cpus (#10018 ) Supports native hdfs functionality on arm cpu This pr mainly upgrades libdfs3 and supports running on arm，and make libhdfs3 with kerberos as default	2022-06-10 19:40:41 +08:00
Pxl	979c81b066	[bugfix] signed long will be converted to signed long during dcheck and cause dcheck fail (#10047 )	2022-06-10 14:26:38 +08:00
Jing Shen	4a474420c8	[feature](function) Add ntile function (#9867 ) Add ntile function. For non-vectorized-engine, I just implemented like Impala, rewrite ntile to row_number and count. But for vectorized-engine, I implemented WindowFunctionNTile.	2022-06-10 10:32:40 +08:00
Gabriel	1220cc147d	[feature](vectorized) Support outfile on vectorized engine (#10013 ) This PR supports output csv format file on vectorized engine. Parquet is still not supported.	2022-06-10 09:15:53 +08:00
yixiutt	3363b3aa19	[fix](load) fix streamload failure due to false unhealthy replica in concurrent stream load (#10007 ) in concurrent stream load, fe will run publish version task concurrently, which cause publish task disorder in be. For example: fe publish task with version 1 2 3 4 be may handle task with sequence 1 2 4 3 In case above, when report tablet info, be found that version 4 published but version 3 not visible, it'll report version miss to fe, and fe will set replica lastFailedVersion, and finally makes transaction commits fail while no quorum health replicas。 Add a time condition if a version miss for 60 seconds then report version miss.	2022-06-10 09:15:14 +08:00
Gabriel	d247d06180	[Improvement] refine codes in TabletReader (#10042 ) fast merge: just remove some code have no affect to other components	2022-06-10 09:12:33 +08:00
Lei Zhang	9c1ba771da	[fix](be) fix asan be set_storage_medium core (#9986 ) (#9987 ) https://github.com/apache/incubator-doris/issues/9986	2022-06-09 23:12:58 +08:00
yinzhijian	dc874709d7	[feature-wip](array-type) support array<decimal128> in mysql_result_writer (#9998 )	2022-06-09 15:15:26 +08:00
camby	6fab1cbf3c	[feature-wip](array-type) Add array functions size and cardinality (#9921 ) Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2022-06-09 15:03:03 +08:00
yinzhijian	19bc14cf8d	[feature-wip](array-type) Add array type support for vectorized parquet-orc scanner (#9856 ) Only support one level array now. for example: - nullable(array(nullable(tinyint))) is support. - nullable(array(nullable(array(xx))) is not support.	2022-06-09 12:11:47 +08:00
starocean999	bf8b4fb2d3	[Bugfix] be crash when executing sql contains bitmap_intersect function (#9910 ) * fix bitmap serialize bug * add regression test for bitmap seralize bugfix * add missing regression test out file * fix reggresion test failed issue	2022-06-09 08:45:46 +08:00
minghong	9c52b4a508	[enhance] improve dict in-predicate evaluate (#10009 )	2022-06-09 00:25:30 +08:00
yiguolei	d9bbf67b9e	[DefaultConfigChange]enable query vectorization and storage vectorization and storage low cardinality optimization by default (#9848 ) Co-authored-by: yiguolei <yiguolei@gmail.com>	2022-06-08 15:29:43 +08:00
HappenLee	94089b9192	[Refactor] Use file factory to replace create file reader/writer (#9505 ) 1. Simplify code logic and improve abstraction 2. Fix the mem leak of raw pointer Co-authored-by: lihaopeng <lihaopeng@baidu.com>	2022-06-08 15:07:39 +08:00
Adonis Ling	fc9afda97a	[enhancement][diagnostics] Add a diagnostic: detect unused includes (#9117 )	2022-06-08 11:52:48 +08:00
Gabriel	35c3e4e33c	[Bug] runtime filter is not used as expected (#10001 ) * [Bug] runtime filter is not used as expected * update	2022-06-08 11:10:39 +08:00
Pxl	f2aa5f32b8	[Feature] [Vectorized] Some pre-refactorings or interface additions for schema change (#9811 ) Some pre-refactorings or interface additions for schema change	2022-06-07 15:04:57 +08:00
Xinyi Zou	d588e99b8b	[fix][mem tracker] Fix logout load task mem tracker dcheck fail (#9943 ) * fix tracker 0602 * fix format	2022-06-07 11:31:49 +08:00
zhengshengjun	49d4798276	[fix](function) fix bug in time_round function (#9712 )	2022-06-06 08:58:22 +08:00
Gabriel	79620f6fa2	[Improvement] change the condition of vectorized compaction (#9950 )	2022-06-04 12:29:23 +08:00
jacktengg	3743f19369	[feature] support convert alpha rowset (#9890 ) Add alpha rowset to beta rowset convert to convert rowset automatically. We will remove alpha rowset's code after 1.1.	2022-06-04 12:29:03 +08:00
Twice	f49284036e	[Enhancement] Refactor functions in int_exp by templates (#9939 )	2022-06-04 11:53:31 +08:00
Mingyu Chen	c996334ad1	[improvement] Optimize send fragment logic to reduce send fragment timeout error (#9720 ) This CL mainly changes: 1. Reducing the rpc timeout problem caused by rpc waiting for the worker thread of brpc. 1. Merge multiple fragment instances on the same BE to send requests to reduce the number of send fragment rpcs 2. If fragments size >= 3, use 2 phase RPC: one is to send all fragments, two is to start these fragments. So that there will be at most 2 RPC for each query on one BE. 3. Set the timeout of send fragment rpc to the query timeout to ensure the consistency of users' expectation of query timeout period. 4. Do not close the connection anymore when rpc timeout occurs. 5. Change some log level from info to debug to simplify the fe.log content. NOTICE: 1. Change the definition of execPlanFragment rpc, must first upgrade BE. 3. Remove FE config `remote_fragment_exec_timeout_ms`	2022-06-03 15:47:40 +08:00
FreeOnePlus	cacad64d2c	[fix] Error when compiling under ARM architecture machine, fixed a loop (#9948 )	2022-06-03 08:00:55 +08:00
Pxl	c0ad1be1bd	[Enhancement][Chore] remove breakpad and unused variable (#9937 )	2022-06-02 20:52:17 +08:00
HappenLee	c426c2e4b1	[Vectorized-Load] Support vectorized load table with materialized view (#9923 ) * [Vectorized-Load] Support vectorized load table with materialized view * fix ut Co-authored-by: lihaopeng <lihaopeng@baidu.com>	2022-06-02 14:59:01 +08:00
zhangstar333	4ea5782838	[fix](Function) fix to_bitmap to return always not nullable (#9859 )	2022-06-02 10:37:45 +08:00
zhangstar333	e896fffd76	[Vectorized][Function] fix bitmap_intersect get wrong result (#9907 )	2022-06-01 23:51:52 +08:00
Lijia Liu	47dfdd8e09	[fix](storage) Disable compaction before schema change is actually executed(#9032 ) (#9065 ) As in issue, the combination and schema change at the same time may lead to version intersection. Describe the overview of changes. 1. Do not do compaction before schema change is actually executed. 2. Set tablet as bad when it has version intersection. 3. Do not do schema change when it can not find appropriate versions to delete in new tablet. 4. Do not change rowsets after compaction if the rowsets of the tablet has changed.	2022-06-01 23:29:18 +08:00
Pxl	ac08c7ac91	[fix](vectorized) fix vcast expr input wrong row number (#9520 )	2022-06-01 15:19:31 +08:00
Gabriel	632f7a3d3d	[Feature] add `weekday` function on vectorized engine (#9901 )	2022-06-01 14:47:37 +08:00
Yongqiang YANG	00719db3a2	[bugfix]handle ComlumnDictionary in evaluate_and and evaluate_or (#9818 ) * handle ColumnDictory in evaluate_or We need to handle ComlumnDictory in evaluate_or, otherwise delete handler would triger a core dump. * handle ComlumnDictionary in evaluate_and Because there is only one difference between evaluate_and and evaluate_or, that is or and delete, I merge two macros into one. Delete handlers also trigger evaluate_and, i am not sure if column dictionary would be used in evaluate_and. * clang format * fix short circut for evaluate_and and evaluate_or * clang format	2022-06-01 08:10:43 +08:00
HappenLee	35f99faa0a	[Bug][Vectorized] fix core dump on vcase_expr::close (#9893 ) Co-authored-by: lihaopeng <lihaopeng@baidu.com>	2022-06-01 08:05:09 +08:00
Xinyi Zou	0376ca17f3	[Enhancement] Remove minidump (#9894 )	2022-06-01 08:04:24 +08:00
minghong	f3193c5ea3	[improvement]opt column_dictinary range filter (#9881 ) * opt column_dictinary range filter * fomart	2022-05-31 22:30:05 +08:00
HappenLee	54e9d49718	[Bug][Vectorized] Fix call nvl function core dump (#9883 ) Co-authored-by: lihaopeng <lihaopeng@baidu.com>	2022-05-31 22:18:38 +08:00
Pxl	d34d631519	[bugfix]fix TableFunctionNode memory leak (#9853 )	2022-05-31 19:20:22 +08:00
Xinyi Zou	c8d303a82c	[bugfix] Fix BE core about vectorized join build thread memtracker switch, and FileStat duplicate	2022-05-31 19:12:42 +08:00

1 2 3 4 5 ...

2166 Commits