doris

Author	SHA1	Message	Date
morrySnow	0086fdbbdb	[enhancement](planner) support delete from using syntax (#17787 ) support syntax delete using, this syntax only support UNIQUE KEY model use the result of `t2` join `t3` to romve rows from `t1` ```sql -- create t1, t2, t3 tables CREATE TABLE t1 (id INT, c1 BIGINT, c2 STRING, c3 DOUBLE, c4 DATE) UNIQUE KEY (id) DISTRIBUTED BY HASH (id) PROPERTIES('replication_num'='1', "function_column.sequence_col" = "c4"); CREATE TABLE t2 (id INT, c1 BIGINT, c2 STRING, c3 DOUBLE, c4 DATE) DISTRIBUTED BY HASH (id) PROPERTIES('replication_num'='1'); CREATE TABLE t3 (id INT) DISTRIBUTED BY HASH (id) PROPERTIES('replication_num'='1'); -- insert data INSERT INTO t1 VALUES (1, 1, '1', 1.0, '2000-01-01'), (2, 2, '2', 2.0, '2000-01-02'), (3, 3, '3', 3.0, '2000-01-03'); INSERT INTO t2 VALUES (1, 10, '10', 10.0, '2000-01-10'), (2, 20, '20', 20.0, '2000-01-20'), (3, 30, '30', 30.0, '2000-01-30'), (4, 4, '4', 4.0, '2000-01-04'), (5, 5, '5', 5.0, '2000-01-05'); INSERT INTO t3 VALUES (1), (4), (5); -- remove rows from t1 DELETE FROM t1 USING t2 INNER JOIN t3 ON t2.id = t3.id WHERE t1.id = t2.id; ``` the expect result is only remove the row where id = 1 in table t1 ``` +----+----+----+--------+------------+ \| id \| c1 \| c2 \| c3 \| c4 \| +----+----+----+--------+------------+ \| 2 \| 2 \| 2 \| 2.0 \| 2000-01-02 \| \| 3 \| 3 \| 3 \| 3.0 \| 2000-01-03 \| +----+----+----+--------+------------+ ```	2023-03-16 13:12:00 +08:00
morrySnow	12d9d19366	[docs](Nereids) add nereids zh-CN docs (#16743 )	2023-03-16 11:52:30 +08:00
yongkang.zhong	e4a1e57d6f	[feature](multi-catalog) support sap hana jdbc catalog and jdbc external table (#17780 )	2023-03-15 20:37:36 +08:00
zhangstar333	85080ee3c3	[vectorized](function) support array_map function (#17581 )	2023-03-15 10:51:29 +08:00
Stalary	ca0367d846	FIX: es doc (#17771 )	2023-03-15 10:40:53 +08:00
Luzhijing	76f486980a	[docs](user)update the users number (#17749 )	2023-03-14 22:42:51 +08:00
zhbinbin	ff9e03e2bf	[Feature](add bitmap udaf) add the bitmap intersection and difference set for mixed calculation of udaf (#15588 ) * Add the bitmap intersection and difference set for mixed calculation of udaf Co-authored-by: zhangbinbin05 <zhangbinbin05@baidu.com>	2023-03-14 20:40:37 +08:00
morrySnow	699159698e	[enhancement](planner) support update from syntax (#17639 ) support update from syntax note: enable_concurrent_update is not supported now ``` UPDATE <target_table> SET <col_name> = <value> [ , <col_name> = <value> , ... ] [ FROM <additional_tables> ] [ WHERE <condition> ] ``` for example: t1 ``` +----+----+----+-----+------------+ \| id \| c1 \| c2 \| c3 \| c4 \| +----+----+----+-----+------------+ \| 3 \| 3 \| 3 \| 3.0 \| 2000-01-03 \| \| 2 \| 2 \| 2 \| 2.0 \| 2000-01-02 \| \| 1 \| 1 \| 1 \| 1.0 \| 2000-01-01 \| +----+----+----+-----+------------+ ``` t2 ``` +----+----+----+------+------------+ \| id \| c1 \| c2 \| c3 \| c4 \| +----+----+----+------+------------+ \| 4 \| 4 \| 4 \| 4.0 \| 2000-01-04 \| \| 2 \| 20 \| 20 \| 20.0 \| 2000-01-20 \| \| 5 \| 5 \| 5 \| 5.0 \| 2000-01-05 \| \| 1 \| 10 \| 10 \| 10.0 \| 2000-01-10 \| \| 3 \| 30 \| 30 \| 30.0 \| 2000-01-30 \| +----+----+----+------+------------+ ``` t3 ``` +----+ \| id \| +----+ \| 1 \| \| 5 \| \| 4 \| +----+ ``` do update ```sql update t1 set t1.c1 = t2.c1, t1.c3 = t2.c3 * 100 from t2 inner join t3 on t2.id = t3.id where t1.id = t2.id; ``` the result ``` +----+----+----+--------+------------+ \| id \| c1 \| c2 \| c3 \| c4 \| +----+----+----+--------+------------+ \| 3 \| 3 \| 3 \| 3.0 \| 2000-01-03 \| \| 2 \| 2 \| 2 \| 2.0 \| 2000-01-02 \| \| 1 \| 10 \| 1 \| 1000.0 \| 2000-01-01 \| +----+----+----+--------+------------+ ```	2023-03-14 19:26:30 +08:00
ElvinWei	36a0d40ac3	Fix errors in the data-partition.md (#17756 )	2023-03-14 10:44:57 +08:00
caoliang-web	76458cf091	[typo](partition)Modify the list partition document #17744	2023-03-14 08:27:26 +08:00
yagagagaga	883ae8a86d	[typo](docs) Add some content for bitmap_hash.md. (#17747 )	2023-03-14 08:27:07 +08:00
gitccl	c302fa2564	[Feature](array-function) Support array_pushfront function (#17584 )	2023-03-13 14:26:02 +08:00
Johnny_Sc	47cfc81925	[fix docs] (#17634 ) Co-authored-by: shenshoucheng <shenshoucheng@jd.com>	2023-03-13 08:06:33 +08:00
ZhangYu0123	33059d92cc	[docs](doc) fix faq docs (#17707 )	2023-03-13 08:05:12 +08:00
slothever	455c800405	[feature](parquet-reader) add rle bool and delta decoder to read AWS Glue (#17112 ) Support delta encoding and rle(bool) to read Glue data add delta bit pack decoder, add delta length byte array decoder, add delta byte array decoder. add rle bool decoder. We find some data type is read with delta encoding on AWS Glue, so it should be supported. The definition of delta encoding can refer to the delta encoding in parquet.	2023-03-12 20:09:58 +08:00
abmdocrt	9b687026bd	[Doc](TLS) add doc for TLS connection (#17683 )	2023-03-12 10:01:07 +08:00
Bowen Liang	a74ef2377f	typo fix in kyuubi doc (#17672 )	2023-03-11 09:11:10 +08:00
superche	48a2fe68ad	[typo](docs) Fix some display errors (#17663 ) * [fix](docs) fix some errors in docs	2023-03-11 09:10:48 +08:00
bobhan1	e1bf9411de	[feature](array function) add support for array_enumerate_uniq (#17541 ) add support for array_enumerate_uniq()	2023-03-10 10:20:49 +08:00
Mingyu Chen	c7aa3f9717	[fix](backup) backup throw NPE when no partition in table (#17546 ) If table has no partition, backup will report error: 2023-03-06 17:35:32,971 ERROR (backupHandler\|24) [Daemon.run():118] daemon thread got exception. name: backupHandler java.util.NoSuchElementException: No value present at java.util.Optional.get(Optional.java:135) ~[?:1.8.0_152] at org.apache.doris.catalog.OlapTable.selectiveCopy(OlapTable.java:1259) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.backup.BackupJob.prepareBackupMeta(BackupJob.java:505) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.backup.BackupJob.prepareAndSendSnapshotTask(BackupJob.java:398) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.backup.BackupJob.run(BackupJob.java:301) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.backup.BackupHandler.runAfterCatalogReady(BackupHandler.java:188) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.common.util.MasterDaemon.runOneCycle(MasterDaemon.java:58) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.common.util.Daemon.run(Daemon.java:116) ~[doris-fe.jar:1.0-SNAPSHOT]	2023-03-10 10:19:37 +08:00
huangzhaowei	4ddd303cfc	[Feature-wip](MySQL Load)Support cancel query for mysql load (#17233 ) Notice some changes: 1. Support cancel query for mysql load 2. Change the thread pool for mysql load manager. 3. Fix sucret path check logic 4. Fix some doc error	2023-03-09 22:08:26 +08:00
superche	0c48bb4d66	[typo](docs) Fix some misspelled words (#17605 )	2023-03-09 21:55:58 +08:00
Ashin Gau	53bf1271ec	[doc](multi-catalog) column type mapping for map&struct types (#17591 )	2023-03-09 19:47:11 +08:00
yagagagaga	49c54e59db	[typo](docs) Fix some misspelled words (#17593 )	2023-03-09 15:24:41 +08:00
xueweizhang	2d027282f3	[fix](profile) modify load profile some bugs and docs (#17533 ) 1. 'insert into' profile has 'insert' type, can not query by 'load' type 2. 'insert into' profile does not have job_id, can not query by job_id. so put all profiles key with query_id 3. 'broker load' profile does not have some infos, npe	2023-03-09 11:58:40 +08:00
Xinyi Zou	397cc011c4	[fix](function) fix AES/SM3/SM4 encrypt/ decrypt algorithm initialization vector bug (#17420 ) ECB algorithm, block_encryption_mode does not take effect, it only takes effect when init vector is provided. Solved: 192/256 supports calculation without init vector For other algorithms, an error should be reported when there is no init vector Initialization Vector. The default value for the block_encryption_mode system variable is aes-128-ecb, or ECB mode, which does not require an initialization vector. The alternative permitted block encryption modes CBC, CFB1, CFB8, CFB128, and OFB all require an initialization vector. Reference: https://dev.mysql.com/doc/refman/8.0/en/encryption-functions.html#function_aes-decrypt Note: This fix does not support smooth upgrades. during upgrade process, query may report error: funciton not found	2023-03-09 09:51:41 +08:00
yagagagaga	8a6a4b82aa	[typo](docs) Add a hyperlink to facilitate user redirect. (#17563 )	2023-03-09 09:47:10 +08:00
ElvinWei	bd5ed2b0c2	[enhancement](histogram) optimize the histogram bucketing strategy, etc (#17264 ) * optimize the histogram bucketing strategy, etc * fix p0 regression of histogram	2023-03-08 20:12:05 +08:00
Tiewei Fang	05b04e4c39	[BugFix](PG catalog) fix that pg catalog can not get all schemas that a pg user can access. (#17517 ) Describe your changes. In the past, pg catalog use sql SELECT schema_name FROM information_schema.schemata where schema_owner='<UserName>'; to select schemas of an user. Howerver, this sql can not find all schemas that a user can access, that because: A user may not be the owner of an schema, but may have read permission on the schema. A user may inherit the permissions of its user group and thus have read permissions on one schema. For these reasons, we replace the sql statement with select nspname from pg_namespace where has_schema_privilege('<UserName>', nspname, 'USAGE');	2023-03-08 19:12:47 +08:00
bobhan1	4ea0d6c5fa	[feature](array_function) add support for array_popfront (#17416 )	2023-03-08 13:57:38 +08:00
gitccl	b1d65f855d	[Feature](array-function) Support array_concat function (#17436 )	2023-03-08 13:57:16 +08:00
Bowen Liang	ae916f7cb3	[docs](doc) Add docs for Apache Kyuubi (#17481 ) * add kyuubi doc of zh-CN & en	2023-03-08 09:36:50 +08:00
Pxl	d8f0ca7108	[Chore](schema change) remove some unused code in schema change (#17459 ) remove some unused code in schema change. remove some row-based config and code.	2023-03-07 09:18:34 +08:00
Yulei-Yang	50bf02024a	[Improvement](meta) support return total statistics of all databases for command show proc '/jobs (#17342 ) currently, show proc jobs command can only used on a specific database, if a user want to see overall data of the whole cluster, he has to look into every database and sum them up, it's troublesome. now he can achieve it simply by giving a -1 as dbId. mysql> show proc '/jobs/-1'; +---------------+---------+---------+----------+-----------+-------+ \| JobType \| Pending \| Running \| Finished \| Cancelled \| Total \| +---------------+---------+---------+----------+-----------+-------+ \| load \| 0 \| 0 \| 0 \| 2 \| 2 \| \| delete \| 0 \| 0 \| 0 \| 0 \| 0 \| \| rollup \| 0 \| 0 \| 1 \| 0 \| 1 \| \| schema_change \| 0 \| 0 \| 2 \| 0 \| 2 \| \| export \| 0 \| 0 \| 0 \| 3 \| 3 \| +---------------+---------+---------+----------+-----------+-------+ mysql> show proc '/jobs/-1/rollup'; +----------+------------------+---------------------+---------------------+------------------+-----------------+----------+---------------+----------+------+----------+---------+ \| JobId \| TableName \| CreateTime \| FinishTime \| BaseIndexName \| RollupIndexName \| RollupId \| TransactionId \| State \| Msg \| Progress \| Timeout \| +----------+------------------+---------------------+---------------------+------------------+-----------------+----------+---------------+----------+------+----------+---------+ \| 17826065 \| order_detail \| 2023-02-23 04:21:01 \| 2023-02-23 04:21:22 \| order_detail \| rp1 \| 17826066 \| 6009 \| FINISHED \| \| NULL \| 2592000 \| +----------+------------------+---------------------+---------------------+------------------+-----------------+----------+---------------+----------+------+----------+---------+ 1 row in set (0.01 sec)	2023-03-07 08:57:55 +08:00
zhangdong	bc48cbff83	[doc](auth)auth doc (#17358 ) * auth doc * auth en doc * add note	2023-03-07 08:05:09 +08:00
ZhangYu0123	78a1d630e4	[docs](typo) fix faq docs, already support rename column. (#17428 ) * Update data-faq.md Already support rename column. * fix --------- Co-authored-by: zhangyu209 <zhangyu209@meituan.com>	2023-03-07 08:03:51 +08:00
huang xu	02015cf153	[docs](typo) Correct the wrong default value of DECIMAL type displayed in the Help CREATE TABLE #17422 Correct the wrong default value of DECIMAL type displayed in the Help CREATE TABLE	2023-03-06 12:50:30 +08:00
Xinyi Zou	9617f46fa5	[improvement](memory) Modify `mem_limit` default value (#17322 ) Modify the default value of mem_limit to auto. auto means process mem limit is equal to max(physical mem * 0.9, 6.4G). 6.4G is the maximum memory reserved for the system.	2023-03-06 10:53:27 +08:00
Yulei-Yang	d8a231f340	[Improvement](auth)(step-2) add ranger authorizer for hms catalog (#17424 )	2023-03-05 21:50:44 +08:00
yagagagaga	7b4fc412c5	[typo](docs) Optimize documents so that users can better understand. (#17295 )	2023-03-04 21:02:45 +08:00
xueweizhang	17164cf7a8	[fix](docs) add logic for batch delete when sequence column exists (#17367 ) * [fix](docs) add logic for batch delete when sequence column exists. Signed-off-by: nextdreamblue <zxw520blue1@163.com> * add docs Signed-off-by: nextdreamblue <zxw520blue1@163.com> * fix docs 2 Signed-off-by: nextdreamblue <zxw520blue1@163.com> --------- Signed-off-by: nextdreamblue <zxw520blue1@163.com>	2023-03-03 16:28:31 +08:00
Adonis Ling	3b94ca5ceb	[chore](macOS) Use LLVM Clang by default (#17292 ) Use LLVM Clang by default	2023-03-03 14:18:02 +08:00
Jover	6ce8200d9e	[doc](typo) external-table-load.md (#17234 ) * fix: external-table-load.md The SQL with a syntax error. * fix: external-table-load.md (Chinese) The SQL with a syntax error.	2023-03-03 14:11:19 +08:00
Hong Liu	11994b76d7	add the tag <version since="dev"> for insert_timeout. (#17316 ) Co-authored-by: smallhibiscus <844981280>	2023-03-03 14:10:49 +08:00
Zhengguo Yang	ba108d40d8	[docs](link) Fix some links in docs is broken (#17335 ) * [docs](link) Fix some links in docs is broken * fix_typo	2023-03-03 14:08:05 +08:00
Tiewei Fang	ba82cd10c6	[Enhencement](Jdbc catalog) Add two optional properties for jdbc catalog (#17245 ) 1. The first property is `only_specified_database`: In the past, `Jdbc Catalog` will synchronize all database from source database. Now we add a parameter called `only_specified_database` to jdbc catalog to allow only the specified database to be synchronized, eg: ```sql create resource if not exists ${resource_name} properties( "type"="jdbc", "user"="root", "password"="123456", "jdbc_url" = "jdbc:mysql://172.18.0.1:${mysql_port}/doris_test?useSSL=false", "driver_url" = "https://doris-community-test-1308700295.cos.ap-hongkong.myqcloud.com/jdbc_driver/mysql-connector-java-8.0.25.jar", "driver_class" = "com.mysql.cj.jdbc.Driver", "only_specified_database" = "true" ); ``` if `only_specified_database` is `true`, jdbc catalog will only synchronize the database which is specified in `jdbc_url`. 2. The second property is `lower_case_table_names`: This property will synchronize jdbc external data source table names in lower case. ```sql create resource if not exists ${resource_name} properties( "type"="jdbc", "user"="doris_test", "password"="123456", "jdbc_url" = "jdbc:oracle:thin:@172.18.0.1:${oracle_port}:${SID}", "driver_url" = "https://doris-community-test-1308700295.cos.ap-hongkong.myqcloud.com/jdbc_driver/ojdbc8.jar", "driver_class" = "oracle.jdbc.driver.OracleDriver", "lower_case_table_names" = "true" ); ```	2023-03-03 00:47:46 +08:00
Mingyu Chen	39f59f554a	[improvement](dry-run)(tvf) support csv schema in tvf and add "dry_run_query" variable (#16983 ) This CL mainly changes: Support specifying csv schema manually in s3/hdfs table valued function s3 ( 'URI' = 'https://bucket1/inventory.dat', 'ACCESS_KEY'= 'ak', 'SECRET_KEY' = 'sk', 'FORMAT' = 'csv', 'column_separator' = '\|', 'csv_schema' = 'k1:int;k2:int;k3:int;k4:decimal(38,10)', 'use_path_style'='true' ) Add new session variable dry_run_query If set to true, the real query result will not be returned, instead, it will only return the number of returned rows. mysql> select * from bigtable; +--------------+ \| ReturnedRows \| +--------------+ \| 10000000 \| +--------------+ This can avoid large result set transmission time and focus on real execution time of query engine. For debug and analysis purpose.	2023-03-02 16:51:27 +08:00
xueweizhang	9f088f6e90	[feature](json) add json_valid function (#17247 ) add json_valid function Signed-off-by: nextdreamblue <zxw520blue1@163.com>	2023-03-02 14:08:52 +08:00
Mingyu Chen	30df268c1f	[fix](hdfs)(catalog) fix BE crash when hdfs-site.xml not exist in be/conf and fix compute node logic (#17244 ) We set LIBHDFS3_CONF env in start_be.sh, so libhdfs3 will try to read this hdfs-site.xml, if file does not exist, it will throw error. But Doris does not handle this error, cause BE crash. This CL mainly changes: Modify start_be.sh to only set LIBHDFS3_CONF if hdfs-site.xml exist. Refactor the HDFSCommonBuilder so that it can return error correctly. Add BE IP info in status, so that we can get ip from error msg like: ERROR 1105 (HY000): errCode = 2, detailMessage = [INTERNAL_ERROR]failed to init reader for file 000.snappy.orc, err: [INTERNAL_ERROR][172.21.0.101]failed to init HDFSCommonBuilder, please check check be/conf/hdfs-site.xml The logic of prefer compute node is wrong, which causing the external table query can only assign up to 3 backends. This CL refactor this logic and also change some FE config: prefer_compute_node_for_external_table If set to true, query on external table will prefer to assign to compute node. And the max number of compute node is controlled by min_backend_num_for_external_table. If set to false, query on external table will assign to any node. min_backend_num_for_external_table Only take effect when prefer_compute_node_for_external_table is true. If the compute node number is less than this value, query on external table will try to get some mix node to assign, to let the total number of node reach this value. If the compute node number is larger than this value, query on external table will assign to compute node only.	2023-03-02 11:09:55 +08:00
gitccl	b0c5250bf9	[Enhancement](tvf) support trim_double_quotes and skip_lines for S3 and HDFS table valued function (#17224 ) support trim_double_quotes and skip_lines for S3 and HDFS table valued function	2023-03-01 23:41:31 +08:00

1 2 3 4 5 ...

1980 Commits