doris

Author	SHA1	Message	Date
shengyunyao	4a17152f40	Add tdigest compression param for pencentile_approx function (#1939 )	2019-10-11 18:56:59 +08:00
EmmyMiao87	e267d031bb	Enhance the speed of avg function (#1889 ) This commit enable the avg operator in fe instead of converting the avg function into sum/count. Also, this commit fix the bug of deciamlv2 avg which cause the core in be. The int128 could not be assinged directly. The speed of avg function is similar to sum function after enhancement.	2019-10-10 22:43:46 +08:00
ZHAO Chun	8f016d3ab2	Make HLL be able to handle invalid data (#1908 ) In this change list 1. validate HLL column when loading data, if data is invalid, this row will be filtered. 2. seems as empty HLL when serializing invalid type of HLL data, with this change, all ingested data will be valid. 3. seems as empty HLL when deserializing nullptr or invalid type of HLL data. With this change, dirty data can be handled normally. 4. rename function empty_hll to hll_empty. 5. disable memtable_flush_execute_test because this will fails sometimes. When tearing down, some thread is not joined, and they will visit destroyed resource, which is invalid.	2019-09-29 10:55:23 +08:00
kangkaisen	b246d93128	Avoid SerDe for aggregation query with object pool (#1854 )	2019-09-26 13:51:13 +08:00
HangyuanLiu	40b9c3571b	Support hll_empty function (#1825 )	2019-09-25 09:28:02 +08:00
ZHAO Chun	c3fccb7a49	Support cast datetime to decimal (#1849 )	2019-09-23 19:56:20 +08:00
ZHAO Chun	93fe10a268	Reduce size of HyperLogLog struct (#1845 ) Now size of HyperLogLog struct is so large that it lead the rowset is too small when ingesting data. In this CL, registers in HyperLogLog are only created when it is needed. When ingesting data, it's normal case that there are only few values in one HyperLogLog.	2019-09-21 14:38:58 +08:00
Mingyu Chen	9aa2045987	Refactor alter job (#1695 )	2019-09-12 16:31:29 +08:00
kangkaisen	cd5cfea5cc	Encapsulate HLL logic (#1756 )	2019-09-09 15:52:10 +08:00
kangkaisen	3f22238012	Add check for to_bitmap function argument (#1747 )	2019-09-05 18:11:38 +08:00
Mingyu Chen	378ce8ca04	Use double when converting TIME type value (#1722 ) TIME type value is saved in DOUBLE, so using int64 can extend the time range.	2019-08-29 21:19:19 +08:00
ZHAO Chun	58801c6ab0	Support converting RowBatch and RowBlockV2 to/from Arrow (#1699 )	2019-08-27 11:30:00 +08:00
kangkaisen	1e4dd77d2a	Add bitmap agg type and udaf (#1610 )	2019-08-26 14:24:42 +08:00
HangyuanLiu	199ff968dc	Fix time zone compatibility (#1631 )	2019-08-13 18:44:35 +08:00
HangyuanLiu	69af50aa8c	Time zone related BE function (#1598 ) Details can be found in time-zone.md document	2019-08-12 20:57:59 +08:00
chenhao	2cb82c57bb	Fix bug that <=> operator and in operator get wrong result (#1516 ) * Fix bug that <=> operator and in operator get wrong result * Add some comment to get_result_for_null * Add an new Binary Operator to replace is_safe_for_null for handleing '<=>' operator * Add EQ_FOR_NULL to TExprOpcode * Remove macro definition last backslash	2019-07-30 11:17:53 +08:00
HangyuanLiu	4aedaea84e	Support TIME type and timediff function (#1505 )	2019-07-23 13:42:39 +08:00
lichaoyong	a9e8113b82	Fix heap-buffer-overflow in split_part() function in StringFunctions (#1482 )	2019-07-15 23:00:37 +08:00
HangyuanLiu	a7390c03f4	Add percentile_approx aggregate function (#1432 )	2019-07-11 16:44:43 +08:00
HangyuanLiu	941dec215b	Add utc_timestamp function (#1456 )	2019-07-11 11:09:08 +08:00
Candy	98bd4b4565	Add string function split_part (#1451 )	2019-07-10 09:47:33 +08:00
HangyuanLiu	adba5249c4	Add dayofweek function (#1376 )	2019-06-25 21:37:42 +08:00
ZHAO Chun	9d03ba236b	Uniform Status (#1317 )	2019-06-14 23:38:31 +08:00
Mingyu Chen	a04cf3a695	Fix bug that get_json_function may not be able to get result (#1295 ) std::string object is unexpectly deconstructed, cause invalid result. and Change log level	2019-06-13 12:58:47 +08:00
kangkaisen	ece34fb838	Make hll function backward compatibility (#1251 )	2019-06-05 11:12:36 +08:00
Mingyu Chen	722a9e71c7	Optimize json functions (#1177 ) 1. get_json_xxx() now support using quoto to escape dot 2. Implement json_path_prepare() function to preprocess json_path Performance of get_json_string() on 1000000 rows reduces from 2.27s to 0.27s	2019-05-21 09:13:12 +08:00
kangkaisen	b2a022b348	Add money_format function (#1064 )	2019-04-29 18:31:24 +08:00
lide	9c82d41981	Support Doris query ES by HTTP way (#925 )	2019-04-28 17:14:44 +08:00
HangyuanLiu	e5a5b6da16	Fix concat_ws return null when argument is null (#923 ) #918	2019-04-15 19:54:02 +08:00
zhaidongbo	a1bfc90320	Support hll_raw_agg in Aggregate Function (#832 ) hll_raw_agg Function aggregates the HLL type value, and return the HLL type value	2019-04-01 16:17:56 +08:00
lide	c34b306b4f	Decimal optimize branch #695 (#727 )	2019-03-22 17:22:16 +08:00
lide	297a5f27a5	Add comment to avoid modification for variable_length (#750 )	2019-03-14 12:28:43 +08:00
lide	cc2fd43c32	Rollback the fix of variable_length for Decimal (#744 )	2019-03-14 09:58:35 +08:00
lide	e2717e14ac	Fix the error of variable_length for Decimal (#724 )	2019-03-11 19:15:56 +08:00
chenhao	f7155217bf	Remove build rows counter in PartitionHashJoinNode (#557 ) * Remove build rows counter in PartitionHashJoinNode * Fix unit test fail in RuntimeProfileTest * Add check for result type length in cast_to_string_val	2019-01-21 14:08:59 +08:00
Salieri1969	4d5f92cce7	Add EsScanNode (#450 )	2019-01-17 17:59:33 +08:00
ZHAO Chun	7380483394	Support UDF (#468 ) Now, user can create UDF with CREATE FUNCTION statement. Doris only support UDF in this version, it will support UDAF/UDTF later.	2018-12-29 09:13:04 +08:00
ZHAO Chun	945aaf8923	Fix core when release UDF cache entry (#462 )	2018-12-24 14:34:31 +08:00
ZHAO Chun	90d71508ff	Add UserFunctionCache to cache UDF's library (#453 ) * Add UserFunctionCache to cache UDF's library This patch replace LibCache with UserFunctionCache. LibCache use HDFS URL to identify a UDF's Library, and when BE process restart all of downloaded library should be loaded another time. We use function id corresponding to a library, and when process restart, all downloaded libraries can be loaded without another downloading. * update	2018-12-21 22:07:21 +08:00
ZHAO Chun	e2bb86cf78	Add Md5Digest to util (#420 )	2018-12-12 20:06:35 +08:00
chenhao7253886	ae8d16c81e	Fix failed cases in regression test (#299 )	2018-11-12 11:15:39 +08:00
chenhao7253886	370e73ce5d	Fix truncation error in CastExpr (#283 )	2018-11-06 18:57:13 +08:00
kangpinghuang	d57e91db6e	Rewrite aes encryption (#264 ) Resolve #257	2018-11-02 15:26:31 +08:00
chenhao7253886	37b4cafe87	Change variable and namespace name in BE (#268 ) Change 'palo' to 'doris'	2018-11-02 10:22:32 +08:00
morningman	2868793b6b	Change license to Apache License 2.0 (#262 )	2018-11-01 09:06:01 +08:00
morningman	051aced48d	Missing many files in last commit In last commit, a lot of files has been missed	2018-10-31 16:19:21 +08:00
morningman	5d3fc80067	Added: * Add streaming load feature. You can execute 'help stream load;' to see more information. Changed: * Loading phase of a certain table can be parallelized, to reduce the load job execution time when multi load jobs to a single table. * Using RocksDB to save the header info of tablets in Backends, to reduce the IO operations and increate speeding of restarting. Fixed: * A lot of bugs fixed.	2018-10-31 14:46:22 +08:00
lide	bea10e4f06	1. hide password and other sensitive information in log and audit log 2. add 2 new proc '/current_queries' and '/current_backend_instances' to monitor the current running queries. 3. add a manual compaction api on Backend to trigger cumulative or base compaction manually. 4. add Frontend config 'max_bytes_per_broker_scanner' to limit to bytes per one broker scanner. This is to limit the memory cost of a single broker load job 5. add Frontend config 'max_unfinished_load_job' to limit load job number: if number of running load jobs exceed the limit, no more load job is allowed to be submmitted. 6. a log of bug fixed	2018-09-19 20:04:01 +08:00
morningman	15a3862d91	merge to fbc08a22bafa643aa9e0d7eb5cb90248ea47bbf8 (#226 ) removed unused logs when datetime format is error	2018-08-24 18:05:36 +08:00
morningman	cc74efb3c5	merge to ddb65b69f9c788e359e191889cb31f15279c41ec (#224 ) 1. Apache HDFS broker support HDFS HA and Hadoop kerberos authentication. 2. New Backup and Restore function. Use Fs Broker to backup your data to HDFS or restore them from HDFS. 3. Table-Level Privileges. Grant fine-grained privileges on table-level to specified user. 4. A lot of bugs fixed. 5. Performance improvement.	2018-08-24 17:12:26 +08:00

1 2

66 Commits