Commit Graph

8276 Commits

Author SHA1 Message Date
092a394782 [improvement](agg)limit the output of agg node (#11461)
* [improvement](agg)limit the output of agg node
2022-08-05 07:53:55 +08:00
75fc830573 [Bug](date function) fix wrong year for format '%x' (#11520) 2022-08-05 06:22:22 +08:00
b6118acc19 [feature](functions) support add_months on vectorized engine (#11518) 2022-08-04 21:39:10 +08:00
d4e6e3edfd [bugfix]fix time accuracy (#11521) 2022-08-04 21:36:20 +08:00
e11024f5cc [enhancement](Nereids)set default join type to CROSS_JOIN (#11459)
set default join type to CROSS_JOIN on join that has no equal on condition when parse sql string.
2022-08-04 21:25:24 +08:00
6dc41d57f3 [enhancement](Nereids)support count, min and avg function (#11374)
1. add count function
2. add min function
3. add avg function
2022-08-04 21:19:32 +08:00
346fdeeee0 [fix](ut) Fix BE UT BetaRowsetTest failed (#11500) 2022-08-04 17:57:57 +08:00
aed0282046 [feature-wip](parquet-reader)get compressed parquet page data (#11493) 2022-08-04 17:44:52 +08:00
591b7f3f92 [multi-catalog](oss)Support hive external table on Ali oss. (#11489) 2022-08-04 17:44:34 +08:00
ef65fbb4f0 [tools](ssb-flat)update script for better performance (#11505) 2022-08-04 17:43:26 +08:00
165f0b5ac3 [fix](auditloader)add execute permission for fe_plugins/auditloader/build.sh (#11507)
Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>
2022-08-04 17:42:31 +08:00
95091256b0 [chore](deps) update bdbje tp doris bdbje, update libhdfs3 to improve performance (#11497) 2022-08-04 17:10:56 +08:00
c176ff54a5 [improvement](orc)improve hdfs scan performance (#11501)
Co-authored-by: root <root@bytedance>
2022-08-04 17:10:30 +08:00
9f221a703b [feature-wip](statistics) step5: show statistics job information (#8862)
This pull request includes some implementations of the statistics(https://github.com/apache/incubator-doris/issues/6370), it will not affect any existing code and users will not be able to create statistics job.

It implements the display of statistics job information, with the following syntax, users will be able to view the corresponding job information.

syntax:
```
SHOW ANALYZE
    [TABLE | ID]
    [
        WHERE
        [STATE = ["PENDING"|"SCHEDULING"|"RUNNING"|"FINISHED"|"FAILED"|"CANCELLED"]]
    ]
    [ORDER BY ...]
    [LIMIT limit][OFFSET offset];
```

e.g.
| id    | create_time             | start_time              | finish_time             | error_msg | scope               | progress | state    |
| ----- | ----------------------- | ----------------------- | ----------------------- | --------- | ------------------- | -------- | -------- |
| 60051 | 2022-07-21 01:26:26.173 | 2022-07-21 01:26:26.186 | 2022-07-21 01:26:27.104 |           | table1(citycode,pv) | 5/5      | FINISHED |
2022-08-04 16:10:49 +08:00
397bf354db [WIP](optional) using hash set to distinct single value (#11246)
* [WIP](optional) using hash set to distinct single value


Co-authored-by: wangbo36@meituan.com <wangbo36@meituan.com>
2022-08-04 15:52:58 +08:00
9078ab4d24 [feature](FE): add new property to control whether use light schema change or not (#11169) 2022-08-04 15:49:05 +08:00
Pxl
ec3c911f97 [Feature][Materialized-View] support materialized view on vectorized engine (#10792) 2022-08-04 14:07:48 +08:00
35f2632c3a [doc]fix-doc (#11439)
Update get-starting.md
2022-08-04 14:00:43 +08:00
277e5e7bd1 [fix](doc)Document correction, delete some old document content, add some explanatory information (#11504)
Document correction, delete some old document content, add some explanatory information
2022-08-04 14:00:03 +08:00
9ed36aa48b [doc](website)fe elastic scaling documentation issue (#11471)
fe elastic scaling documentation issue
2022-08-04 13:57:59 +08:00
8a09f0478f [regressiontest](improvement)SQL query conditions, if the data type is expanded with single quotes, an error will occur case (#11437) 2022-08-04 11:28:12 +08:00
e7f378fec6 [Enhancement](IdGenerator) Use IdGeneratorBuffer to get better performance for getNextId operation when create table, truncate table, add partition and so on (#11479)
Co-authored-by: caiconghui1 <caiconghui1@jd.com>
2022-08-04 11:21:35 +08:00
36784d9131 [bugfix](compile) Fix compile hyperscan failed when build thirdparty #11481 2022-08-04 11:21:17 +08:00
7703912b3e [improvement](error msg)improve the err msg returned when the key not in columns when create table (#11490) 2022-08-04 11:20:49 +08:00
a943adac1a [feature](cache) Add FileCache for RemoteFile (#11186)
Add FileCache for RemoteFile, it will be opened in StoragePolicy.
Cold data in remote file will be download to local cache files.
2022-08-04 10:57:32 +08:00
ecbf87d77b [bugfix](memtracker)fix exceed memory limit log (#11485) 2022-08-04 10:22:20 +08:00
Pxl
ce68d24e95 [Bug](function) fix current_date not equal to curdate (#11463)
* fix current_date not equal to curdate
2022-08-04 09:25:50 +08:00
33053ad1fe [improvement](outfile) support multibyte separator in outfile clause (#11487) 2022-08-04 09:06:06 +08:00
838fdc1354 [Bug](httpserver) Fix bug that http server should not be stoped in destructor if it not running
Co-authored-by: weizuo <weizuo@xiaomi.com>
2022-08-03 19:44:46 +08:00
4ba2422039 [improvement](fe) Remove constant keys in aggregation (#11434) 2022-08-03 19:43:35 +08:00
7366f187d7 [doc](schema change) Correct Schema Change error description (#11456)
Correct Schema Change error description
2022-08-03 18:35:55 +08:00
e1b878fe10 [Improvement](datev2) apply time LUT to datev2/datetimev2 (#11401)
* [Improvement](datev2) apply time LUT to datev2/datetimev2
2022-08-03 17:15:09 +08:00
a47eff1e46 [enhancement](Nereids) support all join type in Nereids that could do join by HashJoinNode (#11446)
add and test join type:
1. inner join
2. left outer join
3. right outer join
4. left semi join
5. right semi join
6. left anti join
7. right anti join
2022-08-03 12:14:17 +08:00
30e7f76def [regression](unique-key-merge)add tpch_sf1_p1 test (#11442) 2022-08-03 11:22:50 +08:00
4864a83f04 [regression](unique-key-merge-test) add ssb_sf1_p1 test (#11447) 2022-08-03 11:22:24 +08:00
5b9b6c9065 [WIP](decimalv3) WIP (#11443)
* [feature-WIP](decimalv3) fix some bugs of decimalv3
2022-08-03 11:21:36 +08:00
77d82bb292 [Bug](MaterializedView) Fix bug of light schema change do not set right unique id cause MV coredump (#11396)
Fix bug of light schema change do not set right unique id cause MV coredump
2022-08-03 11:21:28 +08:00
b892dfdbbd [Improvement](regresstion test) Fix regression test case failure for ASAN build (#11400)
* [improvement](regresstion test) Improve performance of ASAN build by using -O3 and fix mem limit exceed error for nereids test cases

* exclude tpcds_sf1 q72 for ASAN build because this query takes too long time
2022-08-03 11:19:50 +08:00
573ebf235e [enhancement](build) Support customizing extra compile flags (#11444) 2022-08-03 11:02:17 +08:00
4ae4909a2b [refactor](tvf) table-valued-function table (#11452) 2022-08-03 10:39:16 +08:00
1b4d6a620a (feature-wip)[parquet-reader] support page index serde (#11415) 2022-08-03 10:36:06 +08:00
c581855a41 [fix](hive-table) fix bug that hive external table can not query table created by Tez (#11345)
* [fix](hive-table) fix bug that hive external table can not query table created by Tez

If the hive is created by Tez, the location of the table is a second-level director, eg:

/user/hive/warehouse/region_tmp_union_all/
---/user/hive/warehouse/region_tmp_union_all/1
---/user/hive/warehouse/region_tmp_union_all/2

We should recursive traverse the directory to get the real files.
2022-08-03 09:07:47 +08:00
db3ba02993 [fix](planner) Fix an issue where outputSmap's size could grow exponentially (#11378)
* [fix](planner) Fix an issue where outputSmap'size  could grow exponentially
2022-08-03 09:07:00 +08:00
71ea3f2906 fix(typo): in docs(zh-CN) how to be a committer (#11445) 2022-08-03 08:45:22 +08:00
1e59c4054a [minor](log) add some log to observe the change of table's state. (#11448)
When upgrading to 1.x, some table's state may change to ROLLUP,
Resulting in not able to create/drop/modify partition.

I haven't find the root cause yet, so I add some log to observe
the change of table's state.
2022-08-03 08:43:14 +08:00
de4466624d [refactor](schema change)Remove delete from sc (#11441)
* not need call delete handler to filter rows since they are filtered in rowset reader

* need not call delete eval in schema change and remove related code

Co-authored-by: yiguolei <yiguolei@gmail.com>
2022-08-03 03:29:41 +08:00
7148e3dfaa [Community](readme)Update README.md (#11388)
add technical overview and fix some typo.
2022-08-02 22:27:22 +08:00
4850873f66 [doc](sql-block-rule) add more example (#11416)
sql-block-rule : add more example
2022-08-02 18:39:13 +08:00
6e684c695d fix-doc (#11436)
fe-config fix
2022-08-02 18:38:30 +08:00
d6149e4777 [Enhancement](TabletInvertIndex) use StampLock to instead of ReentranReadWriteLock to get better performance (#11422)
Co-authored-by: caiconghui1 <caiconghui1@jd.com>
2022-08-02 17:53:30 +08:00