doris

Author	SHA1	Message	Date
Gabriel	2068bf2dea	[Refactor](predicate) Use primitive type as template argument for predicate (#11647 )	2022-08-11 12:06:44 +08:00
Mingyu Chen	56ce8c0c5c	[feature](http) add api for showing current queries and kill query (#11657 )	2022-08-11 10:32:46 +08:00
toms	27f652aaff	[extension](feature)Mysql database import doris by external tables (#10905 )	2022-08-11 10:18:45 +08:00
Ashin Gau	8f5aed27ec	[feature-wip](parquet-reader)read and decode parquet physical type (#11637 ) # Proposed changes Read and decode parquet physical type. 1. The encoding type of boolean is bit-packing, this PR introduces the implementation of bit-packing from Impala 2. Create a parquet including all the primitive types supported by hive ## Remaining Problems 1. At present, only physical types are decoded, and there is no corresponding and conversion methods with doris logical. 2. No parsing and processing Decimal type / Timestamp / Date. 3. Int_8 / Int_16 is stored as Int_32. How to resolve these types.	2022-08-11 10:17:32 +08:00
zhangjingcun	7fb0913d35	Usage example of modifying the "storage_root_path" parameter (#11659 ) Usage example of modifying the "storage_root_path" parameter	2022-08-11 09:57:50 +08:00
jiafeng.zhang	200b558156	[typo](doc)spark load uses kerberos authentication method (#11662 ) spark load uses kerberos authentication method	2022-08-11 09:57:26 +08:00
huangzhaowei	04d26ddf22	[feature-wip](multi-catalog)Support use catalog.db and show databases from catalog stmt (#11338 ) Support use catalog.db and show databases from catalog stmt.	2022-08-11 09:50:32 +08:00
Stalary	5d99abb3ec	MOD: label with cte doc (#11661 ) insert label with cte doc	2022-08-11 09:45:59 +08:00
minghong	02a3f21b65	[fix](analyzer) InferFilterRule bug: equations in on clause of outer/anti join are not inferable. (#11515 )	2022-08-11 09:36:43 +08:00
Stalary	d60afe82a4	FIX: doc error (#11660 ) es doc fix	2022-08-11 09:20:06 +08:00
chenlinzhong	5d02f70a0f	[remote-udaf](python-samples) use python to impl remote avg and sum s… (#11655 )	2022-08-10 22:13:37 +08:00
Gabriel	a3714981fd	[Bug](schema change) Fix bug for vectorized schema change (#11652 )	2022-08-10 21:42:51 +08:00
Jibing-Li	6c9d158e9b	[fix](script) Fix hdfs-site.xml file name typo. #11653	2022-08-10 21:42:08 +08:00
Stalary	d8427037be	[Bug](doe) Fix some bug (#11594 )	2022-08-10 21:00:05 +08:00
Paul Lin	290f0400d3	[docs] Fix Chinese description in En dos (#11512 )	2022-08-10 20:28:05 +08:00
zhannngchen	70b39475cf	[fix](scanner) delete predicates might be inconsistent with rowset readers (#11598 )	2022-08-10 19:40:54 +08:00
jakevin	976e7685db	[minor](*): remove redundant log and unused code. (#11620 )	2022-08-10 19:28:04 +08:00
Liqf	adf61a77bd	[docs](query profile)add show query profile (#11635 )	2022-08-10 19:26:58 +08:00
xueweizhang	8c344d33e6	[Enhancement](meta) sort result by tablename when show tables like show data (#11638 ) * [improvement] sort result by tablename when show tables like 'show data'	2022-08-10 19:26:30 +08:00
Jerry Hu	a153af9698	[chore](regression-test) Add drop table in aggregate_count1 (#11632 )	2022-08-10 19:25:43 +08:00
Jerry Hu	c8418d13b5	[improvement](config)Use session variable to replace configuration for 'enable_function_pushdown' (#11641 )	2022-08-10 19:25:02 +08:00
Dongyang Li	c6f520fab4	[thirdparty](brpc) fix _dl_sym undefined reference on Ubuntu22.04 (#11643 ) Co-authored-by: qcloud <ubuntu@localhost.localdomain>	2022-08-10 19:23:10 +08:00
jakevin	89d3809a0e	[feature](Nereids): Enable the costAndEnforcerJob (#11604 ) 1. Enable the costAndEnforcerJob 2. Fix some bug of enforcer. 3. polish property name and method	2022-08-10 15:17:15 +08:00
Jerry Hu	0291f84a9e	[fix](like-predicate) Add missing functions in LikeColumnPredicate (#11631 )	2022-08-10 15:03:14 +08:00
caiconghui	71d9b383d4	[Enhancement](hdfs) Support loading hdfs config for be from hdfs-site.xml (#11634 ) * [Enhancement](hdfs) Support loading hdfs config for be from hdfs-site.xml Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2022-08-10 14:49:50 +08:00
zxealous	a478e1d669	[doc][fix] fix the duplicate partition name in example of CREATE TABLE (#11188 )	2022-08-10 10:26:12 +08:00
wangyongfeng	b3d476eebb	[fix](ui)source map files not included in production builds (#11612 ) Co-authored-by: wangyf0555 <wangyongfeng@flywheels.com>	2022-08-10 08:19:07 +08:00
Henry2SS	ae90d45594	[Bug](show data skew)fix show data skew logic (#11616 ) Co-authored-by: wuhangze <wuhangze@jd.com>	2022-08-10 08:18:39 +08:00
Xin Liao	aaaf6915e4	[feature-wip](unique-key-merge-on-write) fix rowid conversion ut that may create a directory under an incorrect path (#11628 )	2022-08-10 08:17:47 +08:00
starocean999	601f28dd90	[fix](regexpr)regexpr functions' contexts should be THREAD_LOCAL (#11595 )	2022-08-10 06:58:24 +08:00
camby	01e4522612	[fix]collect_list/collect_set without GROUP BY for NOT NULL column (#11529 ) Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2022-08-09 20:49:37 +08:00
carlvinhust2012	df47b6941d	[feature-wip](array-type) support the array type in reverse function (#11213 ) Co-authored-by: hucheng01 <hucheng01@baidu.com>	2022-08-09 20:49:09 +08:00
Tiewei Fang	169996d8e4	[feature](information_schema) add `rowsets` table into information_s… (#11266 ) * [feature](information_schema) add 'segments' table into information_schema	2022-08-09 18:15:54 +08:00
Shuo Wang	7b67661262	add plan checker (#11619 ) This PR proposes to add a plan checker to facilitate plan checking in unit tests. Usage of plan checker is like below: ```java new PlanChecker() .plan(myPlan) .applyBottomUp(myRule) .matches(expectedPattern); ```	2022-08-09 17:19:30 +08:00
HB	583b44dfa8	[enhancement](broker) Improve the availability of broker load (#10699 )	2022-08-09 17:00:48 +08:00
Mingyu Chen	cc6c92935a	[minor](log) add a warn log to observer invalid query profile (#11588 ) I try to fix the bug in #10095. the error occurred when I first create a empty table and query it. But I can't reproduced it again. So I add a warn log here to observer	2022-08-09 14:10:03 +08:00
Mingyu Chen	2cadf85988	[improvement](alter) modify table's default replica if table is unpartitioned (#11550 ) Before, if a table is unpartitioned, when executing following alter stmt: ``` alter table tbl1 set ("replication_num" = "1"); ``` Only the tbl1 partition's replication_num is changed (for unpartitioned table, it also has a single partition with same name as table's) But the table's default replication_num is unchanged. So when executing `show create table tbl1`, you will find that the replication_num is still the origin value. This CL mainly changes: 1. For unpartitioned table, if user change it's replication num, both table's and partition's replication_num will be changed.	2022-08-09 14:09:38 +08:00
Liqf	85e67b04e2	fix-doc3 (#11587 ) bloomFilter fix-doc	2022-08-09 13:35:32 +08:00
jiafeng.zhang	fcf767b2e4	[fix](doc)Modify the installation document, the description of disk space limit (#11609 ) Modify the installation document, the description of disk space limit	2022-08-09 13:33:32 +08:00
wudi	a4f9628576	[improvement](datax) improvement json import and support csv writing 1.At present, read_json_by_line and fuzzy_parse are used for json format writing, and the performance of streamload writing will decrease. It is modified to strip_outer_array and fuzzy_parse writing, and the speed is increased by about 3 times. 2.Add csv writing, the column separator is set to \x01, and the row separator is set to \x02, the performance is about 5 times higher than before	2022-08-09 11:50:24 +08:00
ElvinWei	436ee0dd1d	[feature-wip](statistics) step4.1: manually inject statistics for a table or column (#11030 ) This pr mainly to supplement the syntax of the previous pr(#8861), it supports users to manually inject statistics, including table, partition, and column statistics. table/partition stats type: - row_count - data_size column stats type: - ndv - avg_size - max_size - num_nulls - min_value - max_value Modify table or partition statistics: ``` ALTER TABLE table_name SET STATS ('k1' = 'v1', ...) [PARTITIONS(p_name1, p_name2...)] ``` Modify column statistics: ``` ALTER TABLE table_name MODIFY COLUMN columnName SET STATS ('k1' = 'v1', ...) [PARTITIONS(p_name1, p_name2...)] ``` Some notes: - Only support statistics injected into olap type tables. - Statistics injected into temporary partitions are not supported. - When injecting statistics, if it is a partitioned table, users need to specify a partition name. - If multiple partitions are specified, the same stats will be injected on multiple partitions. - The current code also has mock statistics @zhengshij	2022-08-09 11:24:23 +08:00
luozenglin	970a35d658	[fix](docs) Fix some errors related to privilege and grant in the docs (#11377 ) Fix some errors related to privilege and grant in the docs	2022-08-09 11:02:47 +08:00
Stalary	2b918eaccd	[fix](Doris On ES) Fix es not support aliases error (#11547 ) 1. Fix es not support aliases error 2. Fix multicatalog query es error 3. add ut	2022-08-09 09:36:05 +08:00
Kang	f9b151744d	optimize topn query if order by columns is prefix of sort keys of table (#10694 ) * [feature](planner): push limit to olapscan when meet sort. * if olap_scan_node's sort_info is set, push sort_limit, read_orderby_key and read_orderby_key_reverse for olap scanner * There is a common query pattern to find latest time serials data. eg. SELECT * from t_log WHERE t>t1 AND t<t2 ORDER BY t DESC LIMIT 100 If the ORDER BY columns is the prefix of the sort key of table, it can be greatly optimized to read much fewer data instead of read all data between t1 and t2. By leveraging the same order of ORDER BY columns and sort key of table, just read the LIMIT N rows for each related segment and merge N rows. 1. set read_orderby_key to true for read_params and _reader_context if olap_scan_node's sort info is set. 2. set read_orderby_key_reverse to true for read_params and _reader_context if is_asc_order is false. 3. rowset reader force merge read segments if read_orderby_key is true. 4. block reader and tablet reader force merge read rowsets if read_orderby_key is true. 5. for ORDER BY DESC, read and compare in reverse order 5.1 segment iterator read backward using a new BackwardBitmapRangeIterator and reverse the result block before return to caller. 5.2 VCollectIterator::LevelIteratorComparator, VMergeIteratorContext return opposite result for _is_reverse order in its compare function. Co-authored-by: jackwener <jakevingoo@gmail.com>	2022-08-09 09:08:44 +08:00
pengxiangyu	b44c47fc10	[fix] (remote storage) fix bug for storage policy (#11597 )	2022-08-09 09:05:48 +08:00
Kikyou1997	b9f7f63c81	[Fix](planner) Fix wrong planner with count(*) optmizer for cross join optimization (#11569 )	2022-08-09 09:01:25 +08:00
morrySnow	7c950c7cd5	[feature](Nereids) support cross join in Nereids (#11502 ) support cross join in Nereids 1. add PhysicalNestedLoopJoin 2. Translate PhysicalNestedLoopJoin to CrossJoinNode in PhysicalPlanTranslator	2022-08-08 22:14:27 +08:00
morrySnow	1701ffa7c0	[fix](planner)push constant expr in predicate to outer join's other conjuncts by mistake (#11527 ) constant expr in predicate should not be pushed to outer join's other conjuncts	2022-08-08 20:56:08 +08:00
jakevin	4f60b37402	[feature](Nereids):refactor and add outer join LAsscom. (#11531 ) refactor and add outer join LAsscom. Extract the common function to LAsscomHelper.	2022-08-08 20:08:12 +08:00
Fy	647b6e843a	[feature](nereids)add InPredicate in expressions (#11264 ) 1. Add InPredicate expression parser and translator 2. Add regression-test for In predicate (in nereids_syntax) 3. Support NOT EqualTo and NOT InPredicate in ExpressionTranslator#visitNot()	2022-08-08 19:59:54 +08:00

... 16 17 18 19 20 ...

6608 Commits