doris

Author	SHA1	Message	Date
Xinyi Zou	b1b99ae884	[Function] Support Decimal to calculate variance and standard deviation (#4959 )	2020-12-06 08:49:01 +08:00
sduzh	6fedf5881b	[CodeFormat] Clang-format cpp sources (#4965 ) Clang-format all c++ source files.	2020-11-28 18:36:49 +08:00
sduzh	10e1e29711	Remove header file common/names.h (#4945 )	2020-11-26 17:00:48 +08:00
caiconghui	fb7f4c8791	[Bug] fix bug that be thrift client cannot connect to fe thrift server when fe thrift server use `TThreadedSelectorServer` model (#4908 ) Fix bug that be thrift client cannot connect to fe thrift server when fe thrift server use TThreadedSelectorServer model	2020-11-22 20:40:33 +08:00
Mingyu Chen	f1b57c4418	[Optimize] Avoid repeated sending of common components in Fragments (#4904 ) This CL mainly changes: 1. Avoid repeated sending of common components in Fragments In the previous implementation, a query may generate multiple Fragments, these Fragments contain some common information, such as DescriptorTable. Fragment will be sent to BE in a certain order, so these public information will be sent repeatedly and generated repeatedly on the BE side. In some complex SQL, these public information may be very large, thereby increasing the execution time of Fragment. So I improved this. For multiple Fragments sent to the same BE, only the first Fragment will carry these public information, and it will be cached on the BE side, and subsequent Fragments no longer need to carry this information. In the local test, the execution time of some complex SQL can be reduced from 3 seconds to 1 second. 2. Add the time-consuming part of FE logic in Profile Including SQL analysis, planning, Fragment scheduling and sending on the FE side, and the time to fetch data.	2020-11-22 20:38:05 +08:00
Mingyu Chen	796f44beac	[Bug] Fix bug that routine load blocked with TOO_MANY_TASKS error (#4861 ) When receiving empty msg from kafka, the load process will quit abnormally. Fix #4860	2020-11-12 10:05:10 +08:00
Yingchun Lai	74bc25ffe5	[Metrics] Add metric to monitor timeout canceled fragment count (#4862 ) It would be helpful to monitor the count of timeout canceled fragments when there is any issuse cause fragments execute failed or queued too long time.	2020-11-11 21:21:48 +08:00
Mingyu Chen	bfdb15c730	[Bug] Fix some date functions to make their result same as MySQL (#4786 ) dayofweek, dayofmonth, dayofyear, weekofyear, timediff Also fix ut compilation problem	2020-10-27 12:52:44 +08:00
Yingchun Lai	6cbefd5621	[LRUCache] Expose LRU Cache status to metrics (#4688 ) Expose LRU Cache status to metrics would be helpful to diagnose problems like high usage, low hit rate.	2020-10-22 21:37:02 +08:00
Zhengguo Yang	09f97f8a05	[Refactor] Fixes some be typo part 2 (#4747 )	2020-10-20 09:28:57 +08:00
Yingchun Lai	3438a746ac	[Typo] Fix typo in metrics macros (#4739 ) Just fix typo. Rename DEFINE_GAUGE_METRIC_PROTOTYPE_5ARG(name, unit) to DEFINE_GAUGE_METRIC_PROTOTYPE_2ARG(name, unit) Rename DEFINE_GAUGE_METRIC_PROTOTYPE_2ARG(name, unit) witch define core metrics to DEFINE_GAUGE_CORE_METRIC_PROTOTYPE_2ARG(name, unit)	2020-10-15 19:56:43 +08:00
Mingyu Chen	f431d8d94c	[Enhance][Log] Make RPC error log more clear (#4702 ) At present, when some rpc errors occur, the client cannot obtain the error information well. And this CL change the RPC error returned to client like this: ``` ERROR 1064 (HY000): errCode = 2, detailMessage = there is no scanNode Backend. [10002: in black list(A error occurred: errorCode=2001 errorMessage:Channel inactive error!)] ERROR 1064 (HY000): failed to send brpc batch, error=The server is overcrowded, error_text=[E1011]The server is overcrowded @xx.xx.xx.xx:8060 [R1][E1011]The server is overcrowded @xx.xx.xx.xx:8060 [R2][E1011]The server is overcrowded @xx.xx.xx.xx:8060 [R3][E1011]The server is overcrowded @xx.xx.xx.xx:8060, client: yy.yy.yy.yy ```	2020-10-13 10:08:43 +08:00
Zhengguo Yang	75e0ba32a1	Fixes some be typo (#4714 )	2020-10-13 09:37:15 +08:00
HappenLee	04f26e4b7f	[SQL] Support Bucket Shuffle Join (#4677 ) Support Bucket Shuffle Join issue:#4394	2020-10-11 15:37:32 +08:00
ccoffline	f3cdf167d1	[Feature] Add time_round builtin functions (#4640 ) #4619 Add time_round functions that provides `time_floor` & `time_ceil` at each time unit. Fix two related bugs. - #4618 - Fix `struct TimeInterval` to use `int64_t` instead of `int32_t`, in case when the second diff overflow	2020-10-09 16:05:51 +08:00
HaiBo Li	5199a17a4b	[cache][be]Fix the bug of cross-border access cache (#4639 ) * When the different partition of the table is updated frequently, the partition key list of the cache is discontinuous, and the partition key in the request cannot hit the key list in the cache, resulting in the access overrun，the BE will crash. * Add some unit test case，add test cases that fail to hit the boundary value of cache	2020-09-28 13:35:52 +08:00
HappenLee	a61d0de173	[ODBC SCAN NODE] 4/4 Add ODBC_SCAN_NODE and Odbc_Scanner in BE and add ODBC_SCAN_NODE docs (#4438 )	2020-09-25 10:19:50 +08:00
HaiBo Li	5f43fb3bde	[Cache][BE] LRU cache for sql/partition cache #2581 (#4005 ) 1. Find the cache node by SQL Key, then find the corresponding partition data by Partition Key, and then decide whether to hit Cache by LastVersion and LastVersionTime 2. Refers to the classic cache algorithm LRU, which is the least recently used algorithm, using a three-layer data structure to achieve 3. The Cache elimination algorithm is implemented by ensuring the range of the partition as much as possible, to avoid the situation of partition discontinuity, which will reduce the hit rate of the Cache partition, 4. Use the two thresholds of maximum memory and elastic memory to control to avoid frequent elimination of data	2020-09-20 20:50:51 +08:00
qiye	065b979f35	[Bug] behavior of function str_to_date() and date_format() on BE and FE is inconsistent (#4612 ) 1. add date range check in `DateLiteral` for `FEFunctions` 2. `select str_to_date(202009,'%Y%m')` and `select str_to_date(str,'%Y%m') from tb where tb.str = '202009'` will return same output `2020-09-00`. 3. add support of zero-date to function `str_to_date()`,`date_format()` 4. fix FE can calculate negative value bug, eg: `select str_to_date('-2020', '%Y')` will return `NULL` instead of date value. current behavior is same as MySQL without sql_mode `NO_ZERO_IN_DATE` and `NO_ZERO_DATE`. current behavior ``` mysql> select siteid,str_to_date(siteid,'%Y%m%d') from table2 order by siteid; +------------+---------------------------------+ \| siteid \| str_to_date(`siteid`, '%Y%m%d') \| +------------+---------------------------------+ \| 1 \| 2001-00-00 \| \| 2 \| 2002-00-00 \| \| 2 \| 2002-00-00 \| \| 3 \| 2003-00-00 \| \| 4 \| 2004-00-00 \| \| 5 \| 2005-00-00 \| \| 20 \| 2020-00-00 \| \| 202 \| 0202-00-00 \| \| 2020 \| 2020-00-00 \| \| 20209 \| 2020-09-00 \| \| 202008 \| 2020-08-00 \| \| 202009 \| 2020-09-00 \| \| 2020009 \| 2020-00-09 \| \| 20200009 \| 2020-00-09 \| \| 20201309 \| NULL \| \| 2020090909 \| 2020-09-09 \| +------------+---------------------------------+ mysql> select str_to_date('2','%Y%m%d'),str_to_date('20','%Y%m%d'),str_to_date('202','%Y%m%d'),str_to_date('2020','%Y%m%d'),str_to_date('20209','%Y%m%d'),str_to_date('202009','%Y%m%d'),str_to_date('2020099','%Y%m%d'),str_to_date('20200909','%Y%m%d'),str_to_date('2020090909','%Y%m%d'),str_to_date('2020009','%Y%m%d'),str_to_date('20200009','%Y%m%d'),str_to_date('20201309','%Y%m%d'); +----------------------------+-----------------------------+------------------------------+-------------------------------+--------------------------------+---------------------------------+----------------------------------+-----------------------------------+-------------------------------------+----------------------------------+-----------------------------------+-----------------------------------+ \| str_to_date('2', '%Y%m%d') \| str_to_date('20', '%Y%m%d') \| str_to_date('202', '%Y%m%d') \| str_to_date('2020', '%Y%m%d') \| str_to_date('20209', '%Y%m%d') \| str_to_date('202009', '%Y%m%d') \| str_to_date('2020099', '%Y%m%d') \| str_to_date('20200909', '%Y%m%d') \| str_to_date('2020090909', '%Y%m%d') \| str_to_date('2020009', '%Y%m%d') \| str_to_date('20200009', '%Y%m%d') \| str_to_date('20201309', '%Y%m%d') \| +----------------------------+-----------------------------+------------------------------+-------------------------------+--------------------------------+---------------------------------+----------------------------------+-----------------------------------+-------------------------------------+----------------------------------+-----------------------------------+-----------------------------------+ \| 2002-00-00 \| 2020-00-00 \| 0202-00-00 \| 2020-00-00 \| 2020-09-00 \| 2020-09-00 \| 2020-09-09 \| 2020-09-09 \| 2020-09-09 \| 2020-00-09 \| 2020-00-09 \| NULL \| +----------------------------+-----------------------------+------------------------------+-------------------------------+--------------------------------+---------------------------------+----------------------------------+-----------------------------------+-------------------------------------+----------------------------------+-----------------------------------+-----------------------------------+ ```	2020-09-17 10:10:19 +08:00
HuangWei	704bcec9d3	[Bug] add_batch check state fix (#4575 )	2020-09-12 11:18:10 +08:00
Yingchun Lai	b780df697a	[refactor] Optimize threads usage mode in BE (#4440 ) BE can not graceful exit because some threads are running in endless loop. This patch do the following optimization: - Use the well encapsulated Thread and ThreadPool instead of std::thread and std::vector<std::thread> - Use CountDownLatch in thread's loop condition to avoid endless loop - Introduce a new class Daemon for daemon works, like tcmalloc_gc, memory_maintenance and calculate_metrics - Decouple statistics type TaskWorkerPool and StorageEngine notification by submit tasks to TaskWorkerPool's queue - Reorder objects' stop and deconstruct in main(), i.e. stop network services at first, then internal services - Use libevent in pthreads mode, by calling evthread_use_pthreads(), then EvHttpServer can exit gracefully in multi-threads - Call brpc::Server's Stop() and ClearServices() explicitly	2020-09-06 20:19:14 +08:00
Mingyu Chen	5166a6c6bc	[Bug] function str_to_date()'s behavior on BE and FE is inconsistent (#4495 ) Main CL: 1. Copy the code from BE to implement the `str_to_date()` function in FE. 2. `str_to_date("2020-08-08", "%Y-%m-%d %H:%i:%s")` will return `2020-08-08 00:00:00` instead of `2020-08-08`.	2020-09-03 17:16:19 +08:00
Yingchun Lai	498b06fbe2	[Metrics] Support tablet level metrics (#4428 ) Sometimes we want to detect the hotspot of a cluster, for example, hot scanned tablet, hot wrote tablet, but we have no insight about tablets in the cluster. This patch introduce tablet level metrics to help to achieve this object, now support 4 metrics on tablets: `query_scan_bytes `, `query_scan_rows `, `flush_bytes `, `flush_count `. However, one BE may holds hundreds of thousands of tablets, so I add a parameter for the metrics HTTP request, and not return tablet level metrics by default.	2020-09-02 10:39:41 +08:00
Mingyu Chen	a864db03fe	[Bug] Fix bug of load error hub and schema change (#4486 ) 1. When WITH_MYSQL is off, load error hub does not suport MySQL load error hub, we should check its return value. 2. misjudge the return value of `change_row_block` in schema_change.cpp	2020-08-31 23:21:50 +08:00
ZhangYu0123	004b955ca4	[Bug] Fix a null pointer bug in PlanFragmentExecutor. (#4473 ) Fix a null pointer bug in PlanFragmentExecutor. Add null check operation before it is used. Detail: #4472	2020-08-28 09:28:23 +08:00
HappenLee	e4e9af4577	This PR contain three things (#4448 ) 1. Fix core bug wild pointer in PlanFragmentExecutor, fix issue #4447 2. Fix core bug wild pointer json load, fix issue #4452 3. Change the declare order of ODBC type in thrift for compatibility	2020-08-26 10:53:53 +08:00
ZhangYu0123	97d963468a	[Code Cleanup] Template nest convert to c++11 syntax and style (#4442 )	2020-08-26 10:51:52 +08:00
HappenLee	5fc79561d7	[MemTracker][Bug-Fix] Fix core in DECHECK in memory tracker (#4421 ) Fix DECHECK failed in mem_tracker, issue #4420	2020-08-23 22:41:02 +08:00
Mingyu Chen	e25108097d	[Bug][MemTracker] Cleanup the mem tracker's constructor to avoid wrong usage (#4345 ) After PR: #4135, If a mem tracker has parent, it should be created by 'CreateTracker'. So I removed other unused constructors. And also fix the bug described in #4344	2020-08-18 16:54:55 +08:00
lichaoyong	c81862ebec	Remove palo::PInternalService_Stub in BE code. (#4298 ) We can remove the caller in sender side. After all node are upgraded, we can remove the callee in receiver side.	2020-08-10 09:46:17 +08:00
EmmyMiao87	4beed51366	[Comment]Fix spelling error (#4249 )	2020-08-09 00:04:43 +08:00
Yingchun Lai	e71152132c	[metrics] Redesign metrics to 3 layers (#4115 ) Redesign metrics to 3 layers: MetricRegistry - MetricEntity - Metrics MetricRegistry : the register center MetricEntity : the entity registered on MetricRegistry. Generally a MetricRegistry can be registered on several MetricEntities, each of MetricEntity is an independent entity, such as server, disk_devices, data_directories, thrift clients and servers, and so on. Metric : metrics of an entity. Such as fragment_requests_total on server entity, disk_bytes_read on a disk_device entity, thrift_opened_clients on a thrift_client entity. MetricPrototype: the type of a metric. MetricPrototype is a global variable, can be shared by the same metrics across different MetricEntities.	2020-08-08 11:23:01 +08:00
lichaoyong	b62ff8508f	Revert "[BUG] Using attachement strategy of brpc to send packet with big size. (#4237 )" (#4267 ) This reverts commit 120f30bcaec5ba8318ba1849b513b5d06d8df281.	2020-08-06 08:56:07 +00:00
lichaoyong	120f30bcae	[BUG] Using attachement strategy of brpc to send packet with big size. (#4237 ) Using attachement strategy of brpc to send packet with big size. BRPC send packet should serialize it first and then send it. If we send one batch with big size, it will encounter a connection failed. So we can use attachment strategy to bypass the problem and eliminate the serialization cost.	2020-08-05 10:29:41 +08:00
Mingyu Chen	3f31866169	[Bug][Load][Json] #4124 Load json format with stream load failed (#4217 ) Stream load should read all the data completely before parsing the json. And also add a new BE config streaming_load_max_batch_read_mb to limit the data size when loading json data. Fix the bug of loading empty json array [] Add doc to explain some certain case of loading json format data. Fix: #4124	2020-08-04 12:55:53 +08:00
HuangWei	10f822eb43	[MemTracker] make all MemTrackers shared (#4135 ) We make all MemTrackers shared, in order to show MemTracker real-time consumptions on the web. As follows: 1. nearly all MemTracker raw ptr -> shared_ptr 2. Use CreateTracker() to create new MemTracker(in order to add itself to its parent) 3. RowBatch & MemPool still use raw ptrs of MemTracker, it's easy to ensure RowBatch & MemPool destructor exec before MemTracker's destructor. So we don't change these code. 4. MemTracker can use RuntimeProfile's counter to calc consumption. So RuntimeProfile's counter need to be shared too. We add a shared counter pool to store the shared counter, don't change other counters of RuntimeProfile. Note that, this PR doesn't change the MemTracker tree structure. So there still have some orphan trackers, e.g. RowBlockV2's MemTracker. If you find some shared MemTrackers are little memory consumption & too time-consuming, you could make them be the orphan, then it's fine to use the raw ptr.	2020-07-31 21:57:21 +08:00
HappenLee	e6059341e8	[Spill To Disk][2/6] Add some runningtime function and change some function will be called in the future (#4152 )	2020-07-30 14:21:48 +08:00
HappenLee	594e53ec92	[Spill To Disk][1/6] The adjustment of the basic BufferedBlockMgr includes the following change (#4151 ) 1. Add Exec msg for BufferedBlockMgr for debug tuning 2. Change the API of Consume Memory? We will use it in HashTable in the future 3. Fix mistake of count _unfullfilled_reserved_buffers in BufferedBlockMgr	2020-07-30 10:29:12 +08:00
Zhengguo Yang	a2b53b8ddd	[Profile] Add transfer destinations detail to profile (#4161 ) Add transfer destinations detail to profile	2020-07-27 23:37:50 +08:00
HuangWei	a01d1aec56	[Compaction] track RowsetReader's mem & add metric (#4068 ) Ref https://github.com/apache/incubator-doris/issues/3624#issuecomment-655933244 Only RowsetReaders in compaction are under the track. Other RowsetReaders won't be effected, because the parent_tracker is nullptr.	2020-07-24 07:58:09 +08:00
HuangWei	9b0ad66b78	[runtime] Replace the thread pool in FragmentMgr (#4057 )	2020-07-15 10:03:48 +08:00
HappenLee	cd4fec8ab1	[Bug] Fix core of double delete, when RowBatch call transfer_resource_ownership (#4052 ) Resource release should be done by dest RowBatch. When we call method transfer_resource_ownership. if we don't clear the corresponding resources, which will cause the core problem of double delete.	2020-07-13 20:52:22 +08:00
wangbo	efef067f2d	[Bug] Fix mem_pool npe (#4045 ) Fix mem_pool NPE in column reader. Add a safe allocation method.	2020-07-09 21:50:22 +08:00
HappenLee	fafc7e406e	[Spill]Fix the problem of mem exec, when analytic eval node need to spill to disk with a low mem limit (#3991 ) [Bug] Fix the problem of mem exec, when analytic eval node need to spill to disk with a low mem limit. And clear_reservations of Analytic node reservation of block manager. [Running Profile] Add Spilled flag in Running Profile, when Analytic eval node and sort node spill to Disk.	2020-07-09 09:30:22 +08:00
caiconghui	5a27981e49	[Config] Add thrift_client_retry_interval_ms config in be for thrift client to avoid avalanche disaster in fe thrift server (#4022 ) This PR is mainly to add `thrift_client_retry_interval_ms` config in be for thrift client to avoid avalanche disaster in fe thrift server and fix some typo and some rpc setting problems at the same time.	2020-07-08 21:07:00 +08:00
Mingyu Chen	c3d9feed75	[Load][Json] Refactor json load logic to make it more reasonable (#4020 ) This CL mainly changes: 1. Reorganized the code logic to limit the supported json format to two, and the import behavior is more consistent. 2. Modified the statistical behavior of the number of error rows when loading in json format, so that the error rows can be counted correctly. 3. See `load-json-format.md` to get details of loading json format.	2020-07-07 23:07:28 +08:00
HuangWei	9bb7e5d208	Fix some code & comments (#3999 ) TPlanExecParams::volume_id is never used, so delete the print_volume_ids() function. Fix log, and log if PlanFragmentExecutor::open() returns error. Fix some comments	2020-07-03 21:18:47 +08:00
Dayue Gao	f9a52f5db4	[Bug] Insert may leak DeltaWriter when re-analyzed (#3973 )	2020-06-30 11:09:53 +08:00
caiconghui	48d947edf4	Support rpc_timeout property in stream load request to cancel request in fe in time when stream load request is timeout (#3948 ) This PR is to enable cancel stream load request in FE in time when stream load request is timeout to make stream load more robust.	2020-06-29 19:16:16 +08:00
Mingyu Chen	af1beb6ce4	[Enhance] Add prepare phase for some timestamp functions (#3947 ) Fix: #3946 CL: 1. Add prepare phase for `from_unixtime()`, `date_format()` and `convert_tz()` functions, to handle the format string once for all. 2. Find the cctz timezone when init `runtime state`, so that don't need to find timezone for each rows. 3. Add constant rewrite rule for `utc_timestamp()` 4. Add doc for `to_date()` 5. Comment out the `push_handler_test`, it can not run in DEBUG mode, will be fixed later. 6. Remove `timezone_db.h/cpp` and add `timezone_utils.h/cpp` The performance shows bellow: 11,000,000 rows SQL1: `select count(from_unixtime(k1)) from tbl1;` Before: 8.85s After: 2.85s SQL2: `select count(from_unixtime(k1, '%Y-%m-%d %H:%i:%s')) from tbl1 limit 1;` Before: 10.73s After: 4.85s The date string format seems still slow, we may need a further enhancement about it.	2020-06-29 19:15:09 +08:00

1 2 3 4 5 ...

287 Commits