Commit Graph

4317 Commits

Author SHA1 Message Date
Pxl
64cf64d1f8 remove unused code and opt int_div (#8966) 2022-04-13 09:51:01 +08:00
7e08d3e320 Modify the maximum and minimum number of threads in jetty (#8960)
Co-authored-by: smallhibiscus <844981280>
2022-04-13 09:50:46 +08:00
d79e8a7b5a [fix](load) start transaction before we need it (#8819) (#8908) 2022-04-13 09:50:26 +08:00
4462da742f [Refactor][Doc] add show help doc (#8912) 2022-04-13 09:49:21 +08:00
d11698d2de [Refactor][doc] add data model and index doc (#8916) 2022-04-13 09:49:02 +08:00
5f9b241999 [Refactor][doc] Modify the documentation of alter, backup and restore in DDL (#8957) 2022-04-13 09:48:39 +08:00
767000b3f9 [Refactor][doc] add doc for auth management and conf (#8951) 2022-04-13 09:48:17 +08:00
52d18aa83c permute impl for column array; and codes format (#8949)
Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>
2022-04-13 09:47:54 +08:00
59abd65bfc add dml load sql help (#8939)
add dml load sql help
2022-04-13 09:47:30 +08:00
3b8ca4b035 add cluster admin help (#8924)
add cluster admin help
2022-04-13 09:47:12 +08:00
b33ab960a8 [fix] move new add enum OFS of StorageType to last (#8983)
* [fix] move new add enum OFS of StorageType to last

* modify enum in gensrc/thrift/Types.thrift
2022-04-12 20:21:15 +08:00
0c8ea8ce9f [Vectorizd] Let VAssertRowNumNode handle return value of child->get_next (#8969) 2022-04-12 19:56:03 +08:00
6af1c52e13 [Feature] add support for tencent chdfs (#8963)
Co-authored-by: chengwu <chengwu@tencent.com>
2022-04-12 16:02:42 +08:00
51269efbb7 [improvement]Disable mini load (#8955)
Disable miniload by default
2022-04-12 16:01:03 +08:00
5a44eeaf62 [refactor] Unify all unit tests into one binary file (#8958)
1. solved the previous delayed unit test file size is too large (1.7G+) and the unit test link time is too long problem problems
2. Unify all unit tests into one file to significantly reduce unit test execution time to less than 3 mins
3. temporarily disable stream_load_test.cpp, metrics_action_test.cpp, load_channel_mgr_test.cpp because it will re-implement part of the code and affect other tests
2022-04-12 15:30:40 +08:00
66d2f4e1fd [fix][mem tracker] Fix MemTracker null pointer in vectorized (#8925)
Fix ThreadMemTrackerMgr::update_tracker null pointer and some details.

Issue Number: close #8920
2022-04-12 10:17:10 +08:00
f992247779 [docs] update alter table doc (#8915) 2022-04-12 09:26:40 +08:00
7494a4cbdf DML help doc (#8953)
DML help doc
2022-04-12 08:45:16 +08:00
81495b0ff1 [docs]correct spelling of the word 'LABEL' #3265 (#8956) 2022-04-12 08:44:33 +08:00
667eac9b7d Utility-Statements SQL Help (#8952)
Utility-Statements SQL Help
2022-04-12 08:44:16 +08:00
067309c466 [fix](compile) fix compilation bug (#8950) 2022-04-11 13:12:34 +08:00
0b1b3e225d Revert "[Thirdparty]Add llvm for codegen (#8938)" (#8948)
This reverts commit 32133621c69a2d7544549c5ea54ed6d9de60415e.

Reverts #8938
The LLVM requires GLIBC_2.15. I decided to create a branch for the llvm feature first.
And once we resolve the low version glibc issue, it will be merged back to master.
2022-04-11 13:03:16 +08:00
Pxl
8a066e2586 [fix](vectorized) core dump on ST_AsText (#8870) 2022-04-11 09:39:32 +08:00
8158b05ea0 [fix] Fix bug that tablet data size and row num info are failed to report. (#8945)
Introduced from #8146
2022-04-11 09:38:28 +08:00
fd054ca2f6 [doc](java-udf) add docs for Java UDF (#8944) 2022-04-11 09:37:48 +08:00
2abb9c1bca [doc](readme) Add Spark / Flink Connector (#8943)
Add Spark / Flink Connector
2022-04-11 09:37:14 +08:00
7f7172807f [feature](function)(vectorized) Support all geolocation functions on vectorized engine (#8846) 2022-04-11 09:36:53 +08:00
0d761f9909 [feature-wip][UDF][DIP-1] Support variable-size input and output for Java UDF (#8678)
This feature is proposed in DSIP-1. This PR support variable-length input and output Java UDF.
2022-04-11 09:36:16 +08:00
174e22b9f0 [feature](github-action) add scope labeler (#8935) 2022-04-10 23:06:03 +08:00
936b942e3a [fix](error-code) replace invalid format specifier (#8940)
change %lu and %ld to %d
2022-04-10 20:37:32 +08:00
32133621c6 [Thirdparty]Add llvm for codegen (#8938) 2022-04-10 20:37:09 +08:00
6ed59bb98b [refactor](code_style) remove useless inline #8933
1.Member functions defined in a class are inline by default (implicitly), and do not need to be added
2.inline is a keyword used for implementation, which has no effect when placed before the function declaration
2022-04-10 18:29:55 +08:00
1fe4ea4c7c [Refactor-step1] Add OLAPInternalError to status (#8900) 2022-04-10 00:16:43 +08:00
71aedb994e [refactor][doc] add data-table doc (#8927) 2022-04-09 19:18:44 +08:00
1ee8633e5e [fix](account) use LOG.info instead of LOG.debug (#8911)
This complements (#8849)
2022-04-09 19:18:13 +08:00
5706679e08 [fix] fix the problem that using tsan to compile,BE will stack overflow when start (#8904)
Currently TSAN can only be compiled using CLang, not GCC.
And when compiling with -o0, stack overflow occurs at startup, issue #8868.
A function definition will be reported missing at compile time, the file provided in PR #8665 is required.
2022-04-09 19:17:28 +08:00
f28ad36c02 [test][improvement] support execute multiple sql in sql file (#8902)
regression testing framework support execute multiple sql in sql file
2022-04-09 19:15:53 +08:00
ce6b5169c2 [fix](join) Fix error bucket num get in bucket shuffle join in dynamic partition (#8891) 2022-04-09 19:11:44 +08:00
2c1c7f40b6 [refactor][doc] Add data backup, data restore and data delete recovery (#8865)
1.Add data backup doc,
2.add data restore doc,
3.add data delete recovery doc
2022-04-09 19:04:57 +08:00
1de0ea2dc4 [refactor][doc] Added documentation for advanced usage section (#8826)
1.Materialized view
2.Schema Change
3.Dynamic Partition
4.Bucket Shuffle Join
5.Colocation Join
6.Runtime Filter
7.partition cache
8.Orthogonal BITMAP calculation
9.Variable
10.Time zone
11.File Manager
2022-04-09 19:03:43 +08:00
a290104966 [fix](routine load) Routine load task doesn't reallocate when previous BE is down. (#8824)
if previous be is not alive, should assigned another available BE instead.
2022-04-09 19:02:55 +08:00
0f10f84075 [refactor][doc] Add update-delete documentation (#8821) 2022-04-09 19:02:16 +08:00
ddf7ef9327 [improvement](join) update broadcast join cost algorithm (#8695)
broadcast join cost is used compressed data size currently.
The amount of memory used may be significantly more than estimated.
This patch:
1. add a compressed ratio to broadcast join cost and set to 5 according to the experience.
2. add a new session variable `auto_broadcast_join_threshold` to limit memory used by broadcast in bytes, the default value is 1073741824(1GB)
2022-04-09 19:00:27 +08:00
2059e88d43 [fix][doc] remove non-exist outfile.md (#8913) 2022-04-08 23:24:10 +08:00
Pxl
453485abfb [Bug] Fix some bugs(rewrite rule/symbol transport) of like predicate (#8770) 2022-04-08 14:32:09 +08:00
c5718928df [feature-wip](array-type) support explode and explode_outer table function (#8766)
explode(ArrayColumn) desc:
> Create a row for each element in the array column. 

explode_outer(ArrayColumn) desc:
> Create a row for each element in the array column. Unlike explode, if the array is null or empty, it returns null.

Usage example:
1. create a table with array column, and insert some data;
2. open enable_lateral_view and enable_vectorized_engine;
```
set enable_lateral_view = true;
set enable_vectorized_engine=true;
```
3. use explode_outer
```
> select * from array_test;
+------+------+--------+
| k1   | k2   | k3     |
+------+------+--------+
|    3 | NULL | NULL   |
|    1 |    2 | [1, 2] |
|    2 |    3 | NULL   |
|    4 | NULL | []     |
+------+------+--------+

> select k1,explode_column from array_test LATERAL VIEW explode_outer(k3) TempExplodeView as explode_column;
+------+----------------+
| k1   | explode_column |
+------+----------------+
|    1 |              1 |
|    1 |              2 |
|    2 |           NULL |
|    4 |           NULL |
|    3 |           NULL |
+------+----------------+
```
4. explode usage example. explode return empty rows while the ARRAY is null or empty
```
> select k1,explode_column from array_test LATERAL VIEW explode(k3) TempExplodeView as explode_column;
+------+----------------+
| k1   | explode_column |
+------+----------------+
|    1 |              1 |
|    1 |              2 |
+------+----------------+
```
2022-04-08 12:11:04 +08:00
bd0a3369b7 [fix] check disk capacity before writing data (#8887)
1. We forgot to check disk capacity when writing data.
2. TODO: the user specified disk capacity is not used now. We need to find a way to use it.
3. Avoid print too much compaction log when there is not suitable version for compaction.
2022-04-08 11:29:49 +08:00
f854f0e83e remove unreadable char in comment (#8909) 2022-04-08 09:26:53 +08:00
3dd6b42781 [fix](datax) Fix the problem of keyword error when importing datax (#8893) 2022-04-08 09:20:54 +08:00
Pxl
dbbc6549bd [feature](vectorized) support vexplode_bitmap (#8890) 2022-04-08 09:20:26 +08:00