doris

Author	SHA1	Message	Date
Jibing-Li	9c6c2f736e	[Improvement](statistics)Improve stats sample strategy (#26435 ) Improve the accuracy of sample stats collection. For non distribution columns, use `nd / (n - f1 + f1n/N)` where `f1` is the number of distinct values that occurred exactly once in our sample of n rows (from a total of N), and `d` is the total number of distinct values in the sample. For distribution columns, use `ndv(n) * fraction of tablets sampled` for NDV. For very large tablet to sample, use limit to control the total lines to scan (for non key column only, because key column is sorted and will be inaccurate using limit).	2023-11-13 15:52:21 +08:00
Mingyu Chen	4230b8c36c	[doc](hive) fix hive.version doc (#26806 )	2023-11-12 19:38:12 +08:00
yujun	0749d632c4	[feature](diagnose) diagnose for cluster balance (#26085 )	2023-11-10 15:31:58 +08:00
Xinyi Zou	019fb956d3	[docs](cache) Refactor query-cache docs (#26418 )	2023-11-10 13:57:20 +08:00
zhiqiang	a5565f68b2	[Refactor](opentelemetry) Remove opentelemetry (#26605 )	2023-11-09 18:05:34 +08:00
zhengyu	33e46ee13d	[enhancement](config) enable single_replica_load by default in BE (#26619 ) Signed-off-by: freemandealer <freeman.zhang1992@gmail.com>	2023-11-09 12:14:37 +08:00
Tiewei Fang	f6b7046a6e	[fix](regression-test) add tests for jdbc catalog (#26608 )	2023-11-09 11:59:35 +08:00
Tiewei Fang	7df60a4980	[Refactor](Tvf) delete some unused code of tvf and add doc for `queries` tvf (#26460 ) 1. delete some unused code of tvf 2. add doc for `queries` tvf: #25051	2023-11-09 09:06:09 +08:00
zy-kkk	5bcf6bfd46	[fix](jdbc catalog) fix mysql zero date (#26569 )	2023-11-08 21:41:56 +08:00
xy	be7d49cb9f	[Fix](doc) Fixed some errors in the documentation (#26410 ) Co-authored-by: xingying01 <xingying01@corp.netease.com>	2023-11-08 15:19:34 +08:00
zclllyybb	47ba4aaf30	[Enhancement](load) add timer and partitions number limit (#26549 ) add timer and partitions number limit	2023-11-08 11:22:40 +08:00
AKIRA	290070074a	[refactor](stats) refactor collection logic and opt some config (#26163 ) 1. not collect partition stats anymore 2. merge insert of stats 3. delete period collector since it is useless 4. remove enable_auto_sample 5. move some config related to stats to global session variable Before this PR, when analyze a table, the insert count equals column count times 2 After this PR, insert count of analyze table would reduce to column count / insert_merge_item_count. According to my test, when analyzing tpch lineitem, the insert sql count is 1	2023-11-08 11:03:44 +08:00
Yulei-Yang	daea751a98	[Improvement](auditlog) add column catalog for audit log and audit log table (#26403 )	2023-11-08 10:25:15 +08:00
Mingyu Chen	efd1aa3016	[Revert](code-style) revert FE code-format #25033 and #26488 (#26505 )	2023-11-07 16:37:24 +08:00
Petrichor	84b90abeeb	[typo](docs)Improve stream load document (#25893 )	2023-11-06 23:23:45 +08:00
yujun	b99afcc7b5	[docs](x-load) Add table property min_load_replica_num doc (#26333 )	2023-11-06 23:05:35 +08:00
zclllyybb	1a83a39aec	Revert "[fix](auto-partition) Fix auto partition concurrent conflict (#26166 )" (#26448 ) This reverts commit f22611769944e78c28f1b0a1eeb7b7414a16e8db.	2023-11-06 16:39:19 +08:00
Guangdong Liu	d088cba2b1	[feature](code-style)add spotless plugin (#25033 )	2023-11-06 14:01:39 +08:00
zclllyybb	f226117699	[fix](auto-partition) Fix auto partition concurrent conflict (#26166 )	2023-11-06 10:34:26 +08:00
Xinyi Zou	54548728ec	1 (#26363 )	2023-11-03 15:41:31 +08:00
xy720	a5ef90dacc	[enhancement](recover) support skipping missing version in select by session variable (#25654 )	2023-11-02 20:01:51 +08:00
wudi	b94e6d6c05	[doc](fix) update doc for rename column (#25832 )	2023-11-01 11:30:03 +08:00
DongLiang-0	fef520c617	[regression](catalog)Add test case of paimon complex type (#25834 ) Add Paimon complex nested type regression case. Related pr:#25364	2023-11-01 09:59:55 +08:00
Zhiyu Hu	4eb9a52ace	[regression](s3load) Add regression testing and modify description text for s3load (#25947 )	2023-11-01 07:39:16 +08:00
Pxl	696ecc8c83	[Chore](log) adjust error code on too many filtered rows (#26168 )	2023-11-01 00:15:56 +08:00
Tiewei Fang	3eac2a75cb	[fix](docs) modify docs abort outfile and export (#26033 )	2023-10-31 17:25:44 +08:00
slothever	994db8b400	[fix](multi-catalog)add the FAQ for Aliyun DLF and add the fs.xx.impl check (#25594 ) 1. add the FAQ for Aliyun DLF，include the DLF configuration and Jindo SDK locations 2. add the fs.xx.impl check when load cache files	2023-10-31 17:09:52 +08:00
zhiqiang	0449a240f4	[Fix](from_unixtime) Keep consistent with MySQL & bug fix (#25966 ) Bug fix: implicit convert from int32 -> int64 makes negative time stamp valid, so change signature to int64 Consistent: keep consistent with mysql.	2023-10-31 14:31:24 +08:00
Guangming Lu	c32cd5b25a	FE ssl certificates are of various formats #26039 #26044	2023-10-28 22:37:38 +08:00
walter	365fdd2f4d	[feature](backup) add property to remove snapshot before creating repo (#25847 ) Doris is not responsible for managing snapshots, but it needs to clear all snapshots before doing backup/restore regression testing, so a property is added to indicate that existing snapshots need to be cleared when creating a repo. In addition, a regression test case for backup/restore has been added.	2023-10-27 21:03:26 +08:00
Ashin Gau	cad3d8fb99	[doc](complex type) add docs for complex and nested types (#25865 ) Add docs for complex and nested types.	2023-10-27 15:04:53 +08:00
zhannngchen	642c701819	[doc](partial update) add flink-connector docs (#25843 )	2023-10-26 16:53:06 +08:00
Guangdong Liu	3e21e4bdc2	[feature](CANCEL-ALTER-SYSTEM)decommission backend by ids (#25441 ) Issue Number: close #23636	2023-10-25 19:49:38 +08:00
AKIRA	4bda1650e1	[docs & fix](stats) Fix tablesample init failed and some outdated contents in docs (#25603 )	2023-10-25 04:38:00 -05:00
Guangdong Liu	4f17c2a8b1	[feature](alter backends)backend modify & drop & decommission by ids (#25444 )	2023-10-25 14:32:30 +08:00
zhangdong	4cd0dae4b3	[improvement](auth) support show all properties (#25645 ) support `show all properties`	2023-10-24 17:27:59 +08:00
Siyang Tang	51bc49a727	[docs](truncate-table) load may fail when truncating table (#25793 )	2023-10-24 14:10:26 +08:00
DongLiang-0	f23fdcbbef	[typo](doc)Adjust tablet_rowset_stale_sweep_time_sec parameter default value to 300 (#25584 )	2023-10-24 10:32:08 +08:00
bobhan1	28c799ce35	[Docs](partial update) Fix a typo in docs in partial update for insert statement (#25776 )	2023-10-23 17:54:47 +08:00
川流	fbc448520a	[feature](ColdHeatSeperation) Support to upload cold data to HDFS (#22048 )	2023-10-22 21:04:43 +08:00
caoliang-web	68d3c25f26	[typo](doc)Modify the default value of Stale rowset cleanup policy(#25517 )	2023-10-20 15:03:49 +08:00
gnehil	32fe78511a	[typo](doc) update spark connector two phase commit option doc (#24458 )	2023-10-20 10:22:05 +08:00
Siyang Tang	2353582493	[enhancement](load) support for broker load, routine load, mysql load and add docs (#25528 ) cases will be added later.	2023-10-19 15:43:22 +08:00
caoliang-web	4752b800b2	[typo](doc)update config (#25425 )	2023-10-19 10:02:31 +08:00
bobhan1	2a442972a8	[Fix](merge-on-write) Fix some bugs about sequence column (#24915 ) 1. add checks and handling of sequence column in #21896 to insert statement in origin planner and Nereids planner. 2. disable drop sequence mapping column in schema change.	2023-10-18 20:40:12 +08:00
Siyang Tang	0ec537edef	[fix](column-id) fix null conn ctx in column id flusher and parser for `database` field in corresponding show stmt (#25393 )	2023-10-18 14:11:31 +08:00
Jibing-Li	1130317b91	[Improvement](statistics)Collect stats for hive partition column using metadata (#24853 ) Hive partition columns' stats could be calculated from hive metastore data. Doesn't need to execute sql to get the stats. This PR is using hive partition metadata to collect partition column stats.	2023-10-17 10:31:57 +08:00
Tiewei Fang	85b8497624	[fix](Tvf) return empty set when tvf queries an empty file or an error uri (#25280 ) ### Before: return errors when tvf queries an empty file or an error uri: 1. get parsed schema failed, empty csv file 2. Can not get first file, please check uri. ### Now: we just return empty set when tvf queries an empty file or an error uri. ```sql mysql> select * from s3( "uri" = "https://error_uri/exp_1.csv", "s3.access_key"= "xx", "s3.secret_key" = "yy", "format" = "csv") limit 10; Empty set (1.29 sec) ```	2023-10-17 09:52:53 +08:00
Hu Yanjun	fe1980d7f2	[docs](docs) Add release note 2.0.2 (#25375 )	2023-10-16 20:38:45 +08:00
yagagagaga	4c42f3b783	[Improvement](hive-udf)(doc) minimize hive-udf and add some docs. (#24786 )	2023-10-16 16:47:21 +08:00

1 2 3 4 5 ...

2238 Commits