doris

Author	SHA1	Message	Date
Xinyi Zou	e17aef9467	[refactor] refactor the implement of MemTracker, and related usage (#8322 ) Modify the implementation of MemTracker: 1. Simplify a lot of useless logic; 2. Added MemTrackerTaskPool, as the ancestor of all query and import trackers, This is used to track the local memory usage of all tasks executing; 3. Add cosume/release cache, trigger a cosume/release when the memory accumulation exceeds the parameter mem_tracker_consume_min_size_bytes; 4. Add a new memory leak detection mode (Experimental feature), throw an exception when the remaining statistical value is greater than the specified range when the MemTracker is destructed, and print the accurate statistical value in HTTP, the parameter memory_leak_detection 5. Added Virtual MemTracker, cosume/release will not sync to parent. It will be used when introducing TCMalloc Hook to record memory later, to record the specified memory independently; 6. Modify the GC logic, register the buffer cached in DiskIoMgr as a GC function, and add other GC functions later; 7. Change the global root node from Root MemTracker to Process MemTracker, and remove Process MemTracker in exec_env; 8. Modify the macro that detects whether the memory has reached the upper limit, modify the parameters and default behavior of creating MemTracker, modify the error message format in mem_limit_exceeded, extend and apply transfer_to, remove Metric in MemTracker, etc.; Modify where MemTracker is used: 1. MemPool adds a constructor to create a temporary tracker to avoid a lot of redundant code; 2. Added trackers for global objects such as ChunkAllocator and StorageEngine; 3. Added more fine-grained trackers such as ExprContext; 4. RuntimeState removes FragmentMemTracker, that is, PlanFragmentExecutor mem_tracker, which was previously used for independent statistical scan process memory, and replaces it with _scanner_mem_tracker in OlapScanNode; 5. MemTracker is no longer recorded in ReservationTracker, and ReservationTracker will be removed later;	2022-03-11 22:04:23 +08:00
Mingyu Chen	a76889b319	[improvement] Avoid print large string in error log (#8436 ) 1. Avoid print large string in error log If user load a unqualified large string, the all string will be saved in error log, so the error log is too big that can not be shown be using `show load warnings on "url"`. Err: `Got packet bigger than 'max_allowed_packet' bytes` 2. Remove duplicate help doc Do not allow doc with same title, or error thrown when starting FE: `java.lang.IllegalArgumentException: Multiple entries with same key:`	2022-03-11 17:23:47 +08:00
zhangstar333	e0ef9b8f6c	[refactor](vectorized) to_bitmap(-1) return NULL instead of return parse failed error_message (#8373 )	2022-03-11 17:21:47 +08:00
Mingyu Chen	e403dbc38c	[feature](user-property) Support user level exec_mem_limit and load_mem_limit (#8365 ) ``` SET PROPERTY FOR 'jack' 'exec_mem_limit' = '2147483648'; SET PROPERTY FOR 'jack' 'load_mem_limit' = '2147483648'; ``` The user level property will overwrite the value in session variables.	2022-03-11 17:20:09 +08:00
Zhengguo Yang	37c201adea	[doc] fix metadata operation links to external docs (#8403 )	2022-03-11 09:34:10 +08:00
morrySnow	548fde60e5	[doc] Fix dead link in build docker environment in developer guide (#8379 )	2022-03-10 09:06:39 +08:00
HappenLee	8c8416026d	[doc](vectorized) Add vectorized execution engine docs (#8358 )	2022-03-10 09:06:08 +08:00
steadyBoy	3d30209e9b	[docs] add document conditional-functions (#8339 ) Add conditional-functions.	2022-03-09 13:05:13 +08:00
Mingyu Chen	1e70f992e7	[improvement][fix](insert)(replay) support SHOW LAST INSERT stmt and fix json replay bug (#8355 ) 1. support SHOW LAST INSERT In the current implementation, the insert operation returns a json string to describe the result information of the insert. But this information is in the session track field of the mysql protocol, and it is difficult to obtain programmatically. Therefore, I provide a new syntax `show last insert` to explicitly obtain the result of the latest insert operation, and return a normal query result set to facilitate the user to obtain the result information of the insert. 2. the `ReturnRows` field in fe.audit.log of insert operation will be set to the loaded row num of the insert. 3. Fix a bug described in #8354	2022-03-08 18:53:11 +08:00
Zhengguo Yang	3eedd15f9c	[optimize] optimze tablet read, avoid to create too much scanner for small tablet (#8096 )	2022-03-08 13:59:45 +08:00
wunan1210	ca3be24a86	[typo]fix some typo in fe_config (#8325 )	2022-03-07 23:18:38 +08:00
Mingyu Chen	c647def5bc	[community] Modify doris connector release doc (#8275 )	2022-03-07 23:15:38 +08:00
dataalive	1879623521	[docs]update http port doc to be more intuitive (#8343 ) * update http port doc to be more intuitive (#8343)	2022-03-06 16:55:17 +08:00
jiafeng.zhang	ea4ad1a99c	[doc] Add sync job fe configuration item description (#8349 )	2022-03-06 13:49:59 +08:00
Lijia Liu	22a0011403	[fix](planner) Convert format in RewriteFromUnixTimeRule (#8235 ) SQL to reproduce: ``` SELECT * FROM table WHERE where FROM_UNIXTIME(d_datekey,'%Y-%m-%d %H:%i:%s') != '1970-08-20 00:11:43'; org.apache.doris.common.AnalysisException: errCode = 2, detailMessage = Unexpected exception: Illegal pattern character 'i' at org.apache.doris.qe.StmtExecutor.analyze(StmtExecutor.java:584) ~[palo-fe.jar:3.4.0] at org.apache.doris.qe.StmtExecutor.execute(StmtExecutor.java:345) ~[palo-fe.jar:3.4.0] at org.apache.doris.qe.StmtExecutor.execute(StmtExecutor.java:318) ~[palo-fe.jar:3.4.0] at org.apache.doris.qe.ConnectProcessor.handleQuery(ConnectProcessor.java:221) ~[palo-fe.jar:3.4.0] at org.apache.doris.qe.ConnectProcessor.dispatch(ConnectProcessor.java:361) ~[palo-fe.jar:3.4.0] at org.apache.doris.qe.ConnectProcessor.processOnce(ConnectProcessor.java:562) ~[palo-fe.jar:3.4.0] at org.apache.doris.mysql.nio.ReadListener.lambda$handleEvent$0(ReadListener.java:50) ~[palo-fe.jar:3.4.0] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) [?:?] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) [?:?] at java.lang.Thread.run(Thread.java:835) [?:?] ``` Describe the overview of changes. Just support: yyyy-MM-dd HH:mm:ss yyyy-MM-dd yyyyMMdd	2022-03-05 15:26:37 +08:00
ChPi	f57f02bbf2	[improvement] Support show tablets stmt (#7970 ) change `show tablet from tbl` to `show tablets from tbl`	2022-03-05 15:25:57 +08:00
Zhang Yifan	61455ec094	[docs] fix invalid links in docker-dev document (#8313 )	2022-03-05 14:39:37 +08:00
wangbo	450273cfc2	[typo]update spark build doc (#8333 ) [typo]update spark build doc (#8333)	2022-03-05 12:30:56 +08:00
caiconghui	0383001442	[Enhancement] Support Skipping compaction lower replica where select queryable replica for better scan performance (#8146 )	2022-03-05 09:51:50 +08:00
caiconghui	46ca23f216	[Feature] Support Changing the bucketing mode of the table from Hash Distribution to Random Distribution (#8259 ) Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2022-03-04 09:05:23 +08:00
Zhengguo Yang	09bfb8b9d3	[fix] (rpc-udf) Fixed the problem that the query could not be interrupted (#8248 ) if an error occurred in the rpc server during the execution of rpc-udf. Add java,cpp,python demo of rpc-udf server	2022-03-03 09:30:03 +08:00
aopangzi	d5b6428c6d	[improvement] Upgrade MySQL version to 5.7.37 to reduce unnecessary CVE issues (#8247 )	2022-03-02 10:16:02 +08:00
weizuo93	236105daa0	[feature][show-transaction] Support view transactions info for specified status by `SHOW TRANSACTION` stmt (#8156 ) SHOW TRANSACTION WHERE STATUS = 'prepare/precommitted/committed/visible/aborted';	2022-03-02 10:14:42 +08:00
steadyBoy	5cc62fa166	[docs] fix document date-time-functions typo (#8053 ) 1. en: add convert_tz.md 2. en: fix curdate.md, the fix is to modify the title 3. en: add curtime.md 4. en: fix str_to_date.md, the fix is to modify the title 5. zh-CN: fix convert_tz.md, the fix is to modify the description 6. zh-CN: fix curdate.md, the fix is to modify the title	2022-03-02 10:07:44 +08:00
zhangstar333	2b9b0fc1ec	[Fix] Function percentile input null return null (#8238 )	2022-03-01 14:42:48 +08:00
Mingyu Chen	13f007bfd5	[doc] Modify document of compilation on ARM64 (#8254 )	2022-03-01 11:05:18 +08:00
daikon	30fbf1da68	[typo](doc)fix some confusing doc content (#8239 )	2022-02-26 17:05:08 +08:00
caiconghui	83521a826a	[Feature](create_table) Support create table with random distribution to avoid data skew (#8041 ) In some scenarios, users cannot find a suitable hash key to avoid data skew, so we need to provide an additional data distribution for olap table to avoid data skew example: CREATE TABLE random_table ( siteid INT DEFAULT '10', citycode SMALLINT, username VARCHAR(32) DEFAULT '', pv BIGINT SUM DEFAULT '0' ) AGGREGATE KEY(siteid, citycode, username) DISTRIBUTED BY random BUCKETS 10 PROPERTIES("replication_num" = "1"); Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2022-02-26 10:38:55 +08:00
caiconghui	a630e037b9	[Enhancement](routine_load) Support show routine load statement with like predicate (#8188 ) * [Enhancement](routine_load) Support show routine load with like predicate Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2022-02-26 10:35:38 +08:00
Pxl	4c5d7c27df	[Bug] group_concat(value,null) not return null	2022-02-25 11:03:23 +08:00
Xujian Duan	f4c6de554b	[docs] update backup and restore docs (#8161 ) Co-authored-by: 段绪建 <duanxujian@jd.com>	2022-02-23 11:41:14 +08:00
jiafeng.zhang	c3348b8023	[docs] fix config enable_force_drop_redundant_replica name not correct in docs (#8154 )	2022-02-21 09:40:21 +08:00
Lonre Wang	16bf60e0da	[docs] Update data-model-rollup.md (#8126 )	2022-02-21 09:36:31 +08:00
zhannngchen	826738d97f	[docs]Some doc improvements and typo fix (#8153 )	2022-02-21 09:36:01 +08:00
Stalary	b27122df8b	[Docs] add rpc function document (#7984 ) * add rpc function document	2022-02-20 23:26:52 +08:00
Mingyu Chen	13d217b947	[community](doc) Refactor the release and verify doc (#8136 ) Because of the latest moving of some code to a new repository, the documentation for release and verification needs to be reorganized. There are 5 relevant documents as follows. 1. release-prepare.md General instructions for the release and related preparation work. 2. release-doris-core.md The Doris Core release process 3. release-doris-connectors The Doris Connectors release process 4. release-complete.md Release completion after release polling passed. 5. release-verify.md Release verification methods.	2022-02-20 16:31:26 +08:00
dataalive	02531cc536	[doc]fix start/stop command (#8150 )	2022-02-20 12:03:47 +08:00
Pxl	1943a5da5f	[docs] fix some document error and adjust some function document location 1. move `group_concat` from string-functions to aggregate-functions. 2. add `json_array`/`json_object`/`json_quote` to sidebar file. 3. move `json_array`/`json_object`/`json_quote`/`get_json_double`/`get_json_int`/`get_json_string` to json-functions. 4. change `group_concat` document to uppercase	2022-02-19 11:59:55 +08:00
EmmyMiao87	2b1aec871d	[docs] Modify the SSB document (#8101 ) Add one step of preparation before load	2022-02-18 11:14:20 +08:00
jiafeng.zhang	7471873e6f	Add version upgrade instructions (#8057 ) Add version upgrade instructions	2022-02-17 18:06:41 +08:00
jiafeng.zhang	289aacb78c	[improvement] enable check_java_version (#8034 ) Enable to check the Java version when Doris starts, to prevent the user experience caused by the inconsistency between the compiled version and the running version. If the Java version is compiled and the Java version is run, it will not start, and a prompt message will be given.	2022-02-17 11:16:45 +08:00
Mingyu Chen	486a0586ac	[community] modify the doc of verifying apache release (#8084 )	2022-02-17 10:53:31 +08:00
GRX	d9535c29f6	[doc] update alter table docs (#8076 )	2022-02-17 10:52:59 +08:00
Mingyu Chen	26289c28b0	[fix](load)(compaction) Fix NodeChannel coredump bug and modify some compaction logic (#8072 ) 1. Fix the problem of BE crash caused by destruct sequence. (close #8058) 2. Add a new BE config `compaction_task_num_per_fast_disk` This config specify the max concurrent compaction task num on fast disk(typically .SSD). So that for high speed disk, we can execute more compaction task at same time, to compact the data as soon as possible 3. Avoid frequent selection of unqualified tablet to perform compaction. 4. Modify some log level to reduce the log size of BE. 5. Modify some clone logic to handle error correctly.	2022-02-17 10:52:08 +08:00
jiafeng.zhang	79fd81f035	[doc] Added be -238 error code description (#8048 ) Added be -238 error code description	2022-02-17 10:47:52 +08:00
lihuigang	264f38471c	[feature](spark-load) add Hive Bitmap UDFs (#8036 ) Hive Bitmap UDF provides UDFs for generating bitmap and bitmap operations in hive tables. The bitmap in Hive is exactly the same as the Doris bitmap. The bitmap in Hive can be imported into Doris through spark bitmap load.	2022-02-17 10:45:20 +08:00
caiconghui	e6fedff68f	[Refactor][heartbeat] Make get fe heart response by thrift (#8035 ) * [Refactor] Make get fe heart response by thrift Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2022-02-17 10:25:51 +08:00
weizuo93	a6bf8c13eb	[Feature](Transaction) Support two phase commit (2PC) for stream load (#7473 ) The two phase batch commit means： During Stream load, after data is written, the message will be returned to the client, the data is invisible at this point and the transaction status is PRECOMMITTED. The data will be visible only after COMMIT is triggered by client. 1. User can invoke the following interface to trigger commit operations for transaction： curl -X PUT --location-trusted -u user:passwd -H "txn_id:txnId" -H "txn_operation:commit" \ http://fe_host:http_port/api/{db}/_stream_load_2pc or curl -X PUT --location-trusted -u user:passwd -H "txn_id:txnId" -H "txn_operation:commit" \ http://be_host:webserver_port/api/{db}/_stream_load_2pc 2.User can invoke the following interface to trigger abort operations for transaction： curl -X PUT --location-trusted -u user:passwd -H "txn_id:txnId" -H "txn_operation:abort" \ http://fe_host:http_port/api/{db}/_stream_load_2pc or curl -X PUT --location-trusted -u user:passwd -H "txn_id:txnId" -H "txn_operation:abort" \ http://be_host:webserver_port/api/{db}/_stream_load_2pc	2022-02-16 11:55:04 +08:00
Mingyu Chen	884fddbf33	[fix](compatibility) Fix compatibility issue of PRowBatch and some tablet sink bugs (#8000 ) 1. set both `tuple_offsets` and `new_tuple_offsets` in PRowBatch for compatibility 2. set FE config `repair_slow_replica` default to false Avoid impacting the load process after upgrading. Eg, if there are only 2 replicas, one is with high version count. After upgrade, that replica will be set to bad, so that the load process will be stopped because only 1 replica is alive. 3. Fix a bug that NodeChannel may be blocked at `close_wait()` Forget to set `add_batch_finish` flag after the last rpc finished. 4. Fix a NPE of RoutineLoadScheduler	2022-02-15 11:23:19 +08:00
yiguolei	aea3e4e59b	[refactor] Remove version hash from BE and related test in BE (#8027 )	2022-02-14 09:29:27 +08:00

1 2 3 4 5 ...

575 Commits