doris

Author	SHA1	Message	Date
Kaijie Chen	79a61ced42	[docs](load) fix indentation in stream load manual (#22807 )	2023-08-13 10:16:11 +08:00
Kaijie Chen	23add67d14	[fix](load) fix core at memtable writer mem_consumption (#22914 )	2023-08-13 10:10:28 +08:00
zhangguoqiang	41ff48f838	[regresstion][external]fix case test_show_where and es_query 0811 (#22898 )	2023-08-12 19:41:55 +08:00
Jack Drogon	1f8cb3f54a	[Chore](doc) Fix doc zh-CN typo (#22903 ) Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>	2023-08-12 16:14:06 +08:00
Ashin Gau	23094a01d4	[fix](test) load data inpath will remove the data in hdfs (#22908 ) Load data from hdfs in hive will move the source directory into table's location directory, leading the error like Can not get first file, please check uri in tvf test.	2023-08-12 15:12:00 +08:00
flynn	4e880288c6	[refactor]use clear concept to replace std::enable_if_t (#22801 ) --------- Signed-off-by: flynn <fenglv15@mails.ucas.ac.cn>	2023-08-12 15:10:30 +08:00
zxealous	2b81553879	[doc](docs) Add some docs of baidu cloud bos (#22833 ) * [doc](docs) Add some docs of baidu cloud bos * fix	2023-08-12 07:09:57 +08:00
amory	5e2748d2b4	[Improve](complex-type)update orc reader for complex type and add regress tests (#22856 )	2023-08-12 07:06:12 +08:00
Luzhijing	04272f398d	[docs](release note) Update README.md (#22900 )	2023-08-11 22:33:32 +08:00
Jibing-Li	5b09254fac	[improvement](external statistics)Fix external stats collection bugs (#22788 ) 1. Collect external table row count when execute analyze database. 2. Support show cached table stats (row count) 3. Support alter external table column stats. 4. Refresh/Invalidate table row count stat memory cache when analyze task finished and drop table stats.	2023-08-11 21:58:24 +08:00
bobhan1	cd6453434b	[Enhancement](merge-on-write) add correctness check for the calculation of delete bitmap (#22282 ) Currently, for merge-on-write unique table, the delete bitmap of a rowset will be calculated during flush phase, commit phase and publish phase. In this PR, we add a special mark in every rowset considered when we calculate delete bitmap in these three phases. Before we finally merge the delete bitmap to the table meta's delete bitmap, we will check if all the rowsets contain the special mark to check if we have considered all the rowsets during the above three phases. Because the executor can not fail in publish phase if the coordinator have received successful commits info from all the executors, we just print logs if this correctness check failed rather than report a failure.	2023-08-11 21:12:35 +08:00
zy-kkk	44475b64ef	[fix](pg test) fix postgresql jdbc catalog test case (#22875 )	2023-08-11 20:50:47 +08:00
daidai	28561f77e9	[fix](regression)fix test_hdfs_tvf regression_test out file : decimalv3 -> decimal (#22852 )	2023-08-11 20:44:18 +08:00
KassieZ	84ee814bc3	[docs](docs) Update invalid pics of release note 1.1.0 and 2.0-beta (#22804 )	2023-08-11 20:08:21 +08:00
mch_ucchi	130c47e669	[Fix](Nereids)add need forward for enable_nereids_dml and format some cases (#22888 )	2023-08-11 19:35:29 +08:00
mch_ucchi	045843991a	[Fix](Nereids) fix insert into table of random distribution for nereids (#22831 ) currently insert into a table of random distribution info is not supported, we fix it by set physical properties to Any.	2023-08-11 19:26:39 +08:00
jakevin	a2fd488438	[chore](Nereids): polish StatsCalculatorTest (#22884 )	2023-08-11 18:08:18 +08:00
Jibing-Li	7ac4df67ab	[Fix](regression)Fix test_mysql_jdbc_catalog_nereids p2 test case (#22870 ) Fix test_mysql_jdbc_catalog_nereids p2 test case.	2023-08-11 17:57:48 +08:00
DongLiang-0	a089fe3e43	[Improve](jni-avro)Reduce the volume of the avro-scanner-jar package (#22276 ) The avro-scanner-jar package is reduced from 204M to 160M. Hadoop-related dependencies in the original avro pom are directly packaged into a jar package, resulting in a jar volume of 200M. Now since there is already a hadoop jar package environment in be lib, it can be directly referenced.	2023-08-11 17:26:14 +08:00
DongLiang-0	db69457576	[fix](avro)Fix S3 TVF avro format reading failure (#22199 ) This pr fixes two issues: 1. when using s3 TVF to query files in AVRO format, due to the change of `TFileType`, the originally queried `FILE_S3 ` becomes `FILE_LOCAL`, causing the query failed. 2. currently, both parameters `s3.virtual.key` and `s3.virtual.bucket` are removed. A new `S3Utils` in jni-avro to parse the bucket and key of s3. The purpose of doing this operation is mainly to unify the parameters of s3.	2023-08-11 17:22:48 +08:00
Mryange	72e264dd59	[fix](executor)fix error when FixedContainer with null (#22850 )	2023-08-11 17:20:50 +08:00
Dongyang Li	4f7c6aa27f	[fix](case) update nereids_delete_using to pass run without load data (#22853 ) Co-authored-by: stephen <hello-stephen@qq.com>	2023-08-11 17:16:29 +08:00
lsy3993	3e169511e3	[test](jdbc_mysql)update test_jdbc_query_mysql regression test result #22866	2023-08-11 17:15:14 +08:00
starocean999	548226acfc	[fix](planner)shouldn't change the child type to assignmentCompatibleType if it's INVALID_TYPE (#22841 ) if changing the child type to INVALID_TYPE, the later getBuiltinFunction call will fail	2023-08-11 17:14:49 +08:00
TengJianPing	bcac160013	[fix](broadcast shuffle) fix wrong result of broadcast shuffle (#22847 ) When data stream sender is doing broadcast shuffle, it accumulate to batch size and then send blocks to destinations, but for local receivers, it ONLY send the current block, which will cause data loss. This issue is introduced by #22218. If #22218 is pick to 2.0 branch, then also need to pick this PR.	2023-08-11 17:01:11 +08:00
zy-kkk	3e9ba632d7	[typo](docs) Add a guide to using SQL for the jdbc catalog (#22880 )	2023-08-11 16:28:42 +08:00
AKIRA	0c38f42827	[fix](doc) Remove introduction to unstable features (#22832 ) 1. Remove introduction to unstable features 2. Rename some sub-titles to avoid mixed use of chiniese and english	2023-08-11 15:59:16 +08:00
Xinyi Zou	f88f021e52	[fix](bug) Fix BE thread safe start and stop #22560	2023-08-11 15:34:10 +08:00
Xiangyu Wang	8c3b95c523	[Fix](multi-catalog) sync default catalog when forwarding query to master. (#22684 ) Assume that there is a hive catalog named hive_ctl, a hive db named db1 and a table named tbl1, if we connect a slave FE and execute following commands: 1. `switch hive_ctl` 2. `show partitions from db1.tbl1` Then we will meet the error like this: ``` MySQL [(none)]> show partitions from db1.tbl1; ERROR 1049 (42000): errCode = 2, detailMessage = Unknown database 'default_cluster:db1' ``` The reason is that the slave FE will forward the `ShowPartitionStmt` to master FE but we do not sync the default catalog information, so the parser can not find the db and throws this exception. This is just one case, some other simillar cases will failed too.	2023-08-11 14:59:04 +08:00
jakevin	72837a3ab4	[enhancement](Nereids): Plan equals() hashcode() don't need LogicalProprties (#22774 ) - deepEquals don't need to compare LogicalProperties - Plan equals() hashcode() don't need logicalProperty	2023-08-11 14:53:47 +08:00
slothever	209f36f1bf	[fix](multi-catalog)fix jdbc loader (#22814 )	2023-08-11 14:36:19 +08:00
Yulei-Yang	94a7b44540	[Improvement](log) add config to controll compression of fe log & fe audit log (#22865 ) fe log is large for a busy doris cluster, if you want to preserve some historical logs, it cost too much disk space. enable compression is a good way to save space. and a gzip compressed text file can be viewed without decompression.	2023-08-11 14:08:08 +08:00
Qi Chen	f2075d0a81	[Fix](multi-catalog) Fix decimal precision issue in regression test result. (#22819 ) Fix decimal precision issue in regression test result.	2023-08-11 13:49:30 +08:00
jakevin	080d613238	[enhancement](Nereids): speed up rewrite() (#22846 ) - use Set<Integer> instead of Set<String> to speedup `contains` - remove `getValidRules` and use `if` in `for` to avoid `toImmutableList`	2023-08-11 13:04:30 +08:00
Calvin Kirs	caf496a67e	[Chore](RoutineLoad)Change max_batch_interval minimum limit from 5 to 1 (#22858 )	2023-08-11 12:02:20 +08:00
yujun	b9b9071c9b	[improvement](create partition) create partition require quorum replicas succ (#22554 )	2023-08-11 11:59:05 +08:00
Calvin Kirs	e17779f193	[Dependency](fe)Upgrade dependency version (#22496 ) Upgrade guava to 32.1.2-jre Set ck dependency scope to provided Upgrade okio to 3.4.0 Upgrade snake yaml to 1.33 Upgrade aws-java-sdk to 1.12.519 Upgrade hadoop to 3.3.6	2023-08-11 10:54:37 +08:00
zhangguoqiang	9e45b49f16	[regression] disable p0 case window_function 0810 (#22822 )	2023-08-11 10:32:13 +08:00
Xinyi Zou	13cc7a31ab	[fix](bug) Fix page handle safe exit #22849	2023-08-11 09:55:19 +08:00
airborne12	be5d11e0ec	[fix](inverted index) check all pushdown for match predicate (#22836 )	2023-08-11 09:02:15 +08:00
bobhan1	0aa00026bb	[fix](autoinc) ignore column property isAutoInc() for create table as select ... statement(#22827 )	2023-08-10 23:25:54 +08:00
yujun	9dc0f80386	[log](tablet clone) add decommission replica log (#22799 )	2023-08-10 21:41:45 +08:00
Mingyu Chen	a99211d818	[test](ctas) add some ut for testing varchar length in ctas (#22817 ) 1. If derived from a origin column, eg: `create table tbl1 as select col1 from tbl2`, the length will be same os the origin column. 2. If derived from a function, eg: `create table tbl1 as select func(col1) from tbl2`, the length will be 65533. 3. If derived from a constant value, eg: `create table tbl1 as select "abc" from tbl2`, the length will be 65533.	2023-08-10 20:48:12 +08:00
Chuanle Chen	71807ceb5f	[Enhancement](tvf) Table value function support reading local file (#17404 ) I tested the local tvf with tpch queries. First, generate `lineitem` datasets with 6001215 rows, and load it into `lineitem` table by: ``` insert into lineitem select c11, c1, c4, c2, c3, c5, c6, c7, c8, c9, c10, c12, c13, c14, c15, c16 from local( "file_path" = "tools/tpch-tools/bin/tpch-data/lineitem.tbl.1", "backend_id" = "10003", "format" = "csv", "column_separator" = "\|" ); ``` Then, run `q1` and `q16` tpch queries, the query result is correct. It can also analyze the BE's log directly like: ``` mysql> select * from local( "file_path" = "log/be.out", "backend_id" = "10006", "format" = "csv") where c1 like "%start_time%" limit 10; +--------------------------------------------------------+ \| c1 \| +--------------------------------------------------------+ \| start time: 2023年 08月 07日星期一 23:20:32 CST \| \| start time: 2023年 08月 07日星期一 23:32:10 CST \| \| start time: 2023年 08月 08日星期二 00:20:50 CST \| \| start time: 2023年 08月 08日星期二 00:29:15 CST \| +--------------------------------------------------------+ ```	2023-08-10 20:07:42 +08:00
minghong	879024a3a2	disable costmodelV2 (#22830 )	2023-08-10 19:22:24 +08:00
Dongyang Li	0230e3100b	[case](fix) test_big_kv_map, add sync after streamload (#22802 ) test_big_kv_map	2023-08-10 19:02:21 +08:00
zy-kkk	8e5b4005dc	[enhancement](data type) add use_mysql_bigint_for_largeint config Tell Doris to use bigint when returning largeint type to mysql jdbc (#22835 )	2023-08-10 18:53:31 +08:00
wangbo	4360b8520e	[fix](executor)Fix query hang when query bitmap Orth intersect #22828	2023-08-10 18:50:33 +08:00
Jack Drogon	10d4124c82	[Chore](cmake) Refactor thirdparty.cmake by add_thirdparty function (#22777 ) * Refactor thirdparty.cmake by add_thirdparty function Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com> * Refactor thirdparty.cmake - Rename add_library NOADD to NOTADD - Fix tcmalloc not add to COMMON_THIRDPARTY - Fix libintl in OS_MACOSX Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com> --------- Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>	2023-08-10 18:39:09 +08:00
HappenLee	a1223218f3	[pipeline](exec) Support shared scan in jdbc and odbc scan node (#22826 ) Support shared scan in jdbc and odbc scan node to improve exec performance	2023-08-10 18:34:45 +08:00

1 2 3 4 5 ...

12554 Commits