Commit Graph

1618 Commits

Author SHA1 Message Date
cf2de85633 [Document](function) add docs of hll_cardinality/hll_empty/hll_hash (#14547) 2022-11-27 13:40:16 +08:00
6b8f1a1a88 [typo](docs) fix four string functions' document issue #14610
Co-authored-by: yuleiyang <yuleiyang@tencent.com>
2022-11-26 19:27:40 +08:00
7486660c12 [feature](Docs) add backends and rowsets docs (#14542)
* add backends and rowsets docs

* fix doc
2022-11-26 12:24:50 +08:00
b9fffe1633 [typo](docs)Modify the array function doc (#14592) 2022-11-25 22:31:05 +08:00
4728e75079 [feature](bitmap) Support in bitmap syntax and bitmap runtime filter (#14340)
1.Support in bitmap syntax, like 'where k1 in (select bitmap_column from tbl)';
2.Support bitmap runtime filter. Generate a bitmap filter using the right table bitmap and push it down to the left table storage layer for filtering.
2022-11-25 15:22:44 +08:00
7ba4cd764a [enhancement](array-function) array_position,array_contains,countequal which in FunctionArrayIndex handle target NULL (#14564)
in the previous, the result is:
```
mysql> select array_position([1, null], null);
+--------------------------------------+
| array_position(ARRAY(1, NULL), NULL) |
+--------------------------------------+
|                                 NULL |
+--------------------------------------+
1 row in set (0.02 sec)
```

but after this commit, the result become:
```
mysql> select array_position([1, null], null);
+--------------------------------------+
| array_position(ARRAY(1, NULL), NULL) |
+--------------------------------------+
|                                    2 |
+--------------------------------------+
1 row in set (0.02 sec)
```
2022-11-25 14:19:50 +08:00
d5d356b17f [vectorized](function) support order by field function (#14528)
* [vectorized](function) support order by field function

* update

* update test
2022-11-25 14:00:46 +08:00
5ccc875824 [fix](recycle) refactor the logic of erase meta with same name (#14551)
in #14482, we implement the feature to keep specific number of meta with same name in catalog recycle bin.
But it will cause meta replay bug.
Because every time we drop db/table/partition, it will try to erase a certain number of meta with same name.
And when replay "drop" edit log, it will do same thing. But the number of meta to erase it based on current config value,
not persist in edit log, so it will cause inconsistency with "drop" and "replay drop".

In this PR, I move the "erase meta with same name" logic to the daemon thread of catalog recycle bin.
2022-11-25 09:47:24 +08:00
bc699511d0 [Fix](array-function) fix array_distinct null values (#14544)
in the previous the result is:
```
mysql> select array_distinct([1,1,3,3,null, null, null]);
+-----------------------------------------------------+
| array_distinct(ARRAY(1, 1, 3, 3, NULL, NULL, NULL)) |
+-----------------------------------------------------+
| [1, 3, NULL, NULL, NULL]                            |
+-----------------------------------------------------+
1 row in set (0.00 sec)
```

after this fix, the result becomes:
```
mysql> select array_distinct([1,1,3,3,null, null, null]);
+-----------------------------------------------------+
| array_distinct(ARRAY(1, 1, 3, 3, NULL, NULL, NULL)) |
+-----------------------------------------------------+
| [1, 3, NULL]                                        |
+-----------------------------------------------------+
1 row in set (0.00 sec)
```
2022-11-24 19:07:28 +08:00
59b31a03c4 [Improvement](agg function) support group_bit_and/group_bit_or/group_bit_xor functions (#14386) 2022-11-24 16:46:42 +08:00
9e39a04b63 [Doc](flink connector) add flink connector faq (#14520) 2022-11-23 21:40:00 +08:00
181f1cf176 [Docs](function) add some missing function docs (#14510) 2022-11-23 21:39:17 +08:00
648fd93dc5 [DOCS](function) add document for grouping and grouping_id (#14472) 2022-11-23 18:07:48 +08:00
09cc385caa [Docs](fucntion) Add docs for function random, mod, fmod (#14444) 2022-11-23 16:22:57 +08:00
Pxl
672d3b8334 [Document](function) update document of to_base64/from_base64 #14487 2022-11-23 14:04:53 +08:00
79688c34a1 [feature](catalog) add max num of same name meta information in catalog recycle bin (#14482) 2022-11-23 14:04:14 +08:00
wxy
9d165c6bba [docs](fe.conf) add configuration for decommission_tablet_check_threshold. (#14355) 2022-11-23 12:58:52 +08:00
9ec66f939a [fix](docs) fix the docs of command "show proc '/statistic'" and ADMIN CHECK TABLET (#14282)
Co-authored-by: smallhibiscus <844981280>
2022-11-22 21:53:11 +08:00
6124d0e0b7 [fix](doc) add doc description with multi partition (#14262) 2022-11-22 21:51:55 +08:00
2eca51f3ba [enhancement](broker) broker load support tencent cos (#12801) 2022-11-22 21:51:15 +08:00
6eeebd47a9 [improvement](doc) add missing documents (#14460) 2022-11-22 21:42:00 +08:00
e77151868d [Docs](table-valued-function) add docs for s3 and hdfs tvf (#14369) 2022-11-22 21:31:47 +08:00
b04ec41c1d [Vectorized](udaf) fix java-udaf couldn't get jar core dump (#14393)
fix java-udaf couldn't get jar core dump
2022-11-22 20:49:02 +08:00
d8b03db45a [typo](docs) add-kafka-kerberos-version #14489 2022-11-22 18:45:52 +08:00
b9f017ebb1 [typo](docs) kafka kerberos #14479 2022-11-22 14:42:16 +08:00
74f694753b Fix the en docs of benchmark (#14459) 2022-11-22 08:40:51 +08:00
b36f3d7e61 [typo](docs) fix typo in schema-change.md (#14311) 2022-11-21 13:38:47 +08:00
ce489cf723 [Feature](JDBC)support clickhouse jdbc external table (#14244) 2022-11-21 10:33:53 +08:00
98cea90950 [typo](docs)benchmark doc fix number (#14427) 2022-11-20 22:51:42 +08:00
c29975d347 [Docs](function) Add some function do not in sidebars (#14426) 2022-11-20 22:50:52 +08:00
71e80e8957 [typo](docs)Performance test documentation update (#14147)
* Performance test documentation update
2022-11-20 09:40:57 +08:00
2ccb5209a0 (improvement)[doc] add document version tag instruction (#14406) 2022-11-20 00:05:53 +08:00
f5f2e84e31 [refactor](planner) remove the limit return rows of order by (#12478)
Originally, Order By Limit returned a maximum of 65535 rows of data by default during the query,
but now many businesses do not apply this limit.
It is necessary to add larger data after the query statement to complete the full data query,
which is extremely inconvenient, so adjustments have been made.

At the same time, I added the variable DEFAULT_ORDER_BY_LIMIT to the SessionVariable,
the default value is -1, if the user does not use the LIMIT keyword or the LIMIT value is a negative integer,
the default query return value is Long.MAX_VALUE. If the corresponding maximum query value is set,
the number of data items is returned according to the maximum query value or the value followed by the
LIMIT keyword.
2022-11-19 12:45:44 +08:00
b4aef889f2 [feature-array](array-function) add array constructor function array() (#14250)
* [feature-array](array-function) add array constructor function `array()`

```
mysql>  select array(qid, creationDate) from nested_c_2  limit 10;
+------------------------------+
| array(`qid`, `creationDate`) |
+------------------------------+
| [1000038, 20090616074056]    |
| [1000069, 20090616075005]    |
| [1000130, 20090616080918]    |
| [1000145, 20090616081545]    |
+------------------------------+
10 rows in set (0.01 sec)
```
2022-11-19 10:49:50 +08:00
2c4236fd24 [improvement](ctas) use string type for varchar/char/string (#14382)
When executing create table as select stmt,
the varchar/char/string type of column in created table will be unified to string type.

Because when select from external table (mysql/pg, etc), the length of varchar in external database
is calculated by "char" length, not "byte" length.
So if there is a column with varchar(10) in external table, then there will be a same varchar(10)
in created table. But the byte length of data in external table may be larger than 10, causing failure of CTAS.

Change to string will not impact performance of the capacity of disk storage.
And notice that if a string type column is the first column, it will be changed to varchar(65535),
because we do not allow string type column as sort key column.
2022-11-18 14:20:13 +08:00
fb140d0180 [Enhancement](sequence-column) optimize the use of sequence column (#13872)
When you create the Uniq table, you can specify the mapping of sequence column to other columns.
You no longer need to specify mapping column when importing.
2022-11-17 22:39:09 +08:00
8fe5211df4 [improvement](multi-catalog)(cache) invalidate catalog cache when refresh (#14342)
Invalidate catalog/db/table cache when doing
refresh catalog/db/table.

Tested table with 10000 partitions. The refresh operation will cost about 10-20 ms.
2022-11-17 20:47:46 +08:00
a4d4fc8c02 datax doris writer doc fix (#14344) 2022-11-17 13:08:32 +08:00
0bf6d1fd79 [typo](doc)Datax doris writer doc update (#14328) 2022-11-17 08:53:55 +08:00
3259fcb790 [typo](docs) fix docs kafka-load.md (#14313) 2022-11-16 23:17:30 +08:00
70cc725649 [Vectorized](function) support avg_weighted/percentile_array/topn_wei… (#14209)
* [Vectorized](function) support avg_weighted/percentile_array/topn_weighted functions

* update add to stringRef
2022-11-15 16:38:38 +08:00
f86886f8f5 [Feature](function) Support array_compact function (#14141) 2022-11-15 14:24:37 +08:00
93e5d8e660 [Vectorized](function) support bitmap_from_array function (#14259) 2022-11-15 01:55:51 +08:00
7eed5a292c [feature-wip](multi-catalog) Support hive partition cache (#14134) 2022-11-14 14:12:40 +08:00
23a8c7eeb6 (fix)(multi-catalog)(es) Fix error result because not used fields_context (#14229)
Fix error result because not used fields_context
2022-11-14 14:00:55 +08:00
43490a33a5 [feature-array](array-type) Add array function array_with_constant (#14115)
Return array of constants with length num.

```
mysql> select array_with_constant(4, 1223);
+------------------------------+
| array_with_constant(4, 1223) |
+------------------------------+
| [1223, 1223, 1223, 1223]     |
+------------------------------+
1 row in set (0.01 sec)
```
co-authored-by @eldenmoon
2022-11-11 22:08:43 +08:00
0ba13af8ff [feature](running_difference) support running_difference function (#13737) 2022-11-11 21:22:56 +08:00
a162dab40a [feature](docs) add docs for SHOW-CATALOG-RECYCLE-BIN (#14185) 2022-11-11 15:54:05 +08:00
b6ba654f5b [Feature](Sequence) Support sequence_match and sequence_count functions (#13785) 2022-11-11 13:38:45 +08:00
7782fb63ca [docs](outfile) Add ORC to outfile document (#14153) 2022-11-11 09:42:30 +08:00