Commit Graph

6608 Commits

Author SHA1 Message Date
25a6be850d [doc](fix)Export doc fix (#11584)
export doc fix
2022-08-08 19:27:57 +08:00
ed7f7dead9 [Refactor](push-down predicate) Derive push-down predicate from vconjuncts (#11468)
* [Refactor](push-down predicate) Derive push-down predicate from vconjuncts
2022-08-08 19:19:26 +08:00
0a5fd99d02 [feature-wip](unique-key-merge-on-write) speed up publish_txn (#11557)
In our origin design, we calc delete bitmap in publish txn, and this operation
will cost too much time as it will load segment data and lookup row key in pre
rowset and segments.And publish version task should run in order, so it'll lead
to timeout in publish_txn.

In this pr, we seperate delete_bitmap calculation to tow part, one of it will be
done in flush mem table, so this work can run parallel. And we calc final
delete_bitmap in publish_txn, get a rowset_id set that should be included and
remove rowsets that has been compacted, the rowset difference between memtable_flush
and publish_txn is really small so publish_txn become very fast.In our test,
publish_txn cost about 10ms.

Co-authored-by: yixiutt <yixiu@selectdb.com>
2022-08-08 18:57:55 +08:00
c1c635e944 [Refactor](Nereids) Fix expression constant and improve SlotExtractor (#11513)
1. Fix expression constant and add unit test.
2. Improve logic in SlotExtractor and remove useless class IterationVisitor.
2022-08-08 17:36:21 +08:00
9349746987 [Fix](stream-load-json) fix VJsonReader::_write_data_to_column invalid column type cast when meet null (#11564)
column_ptr will be a none nullable column pointer after `column_ptr = &nullable_column->get_nested_column()`
so we should not cast column_ptr to ColumnNullable any more
2022-08-08 15:57:39 +08:00
6c065d3d59 [script](start_fe) support "--version" to show fe build info (#11563) 2022-08-08 15:55:01 +08:00
87f56914e9 [Improvement](debug message) add necessary info to DCHECK message (#11586) 2022-08-08 15:54:09 +08:00
411254c128 [Enhancement](hdfs) Support loading hdfs config from hdfs-site.xml (#11571) 2022-08-08 14:18:28 +08:00
37d1180cca [feature-wip](parquet-reader)decode parquet data (#11536) 2022-08-08 12:44:06 +08:00
Pxl
2cd3bf80dc [bugfix](schema change)fix core dump on vectorized_alter_table (#11538) 2022-08-08 10:45:28 +08:00
b93860902f [doc](tablet-health) modify content about tablet state (#11086)
Co-authored-by: caiconghui1 <caiconghui1@jd.com>
2022-08-08 10:43:13 +08:00
1e6a3610a7 [feature-wip](unique-key-merge-on-write) optimize rowid conversion and add ut (#11541) 2022-08-08 10:41:44 +08:00
e8a344b683 [feature-wip](parquet-reader) add predicate filter and column reader (#11488) 2022-08-08 10:21:24 +08:00
40b50400b2 [fix](doc) remove docs for direct compiling on Centos (#11575)
I tried to compile doris on Centos directly according to docs, however
it does not work. It is very difficult to find tools needed by doris
compilation on Centos.
2022-08-08 09:56:47 +08:00
6ea3465264 [improvement](doc)Description of bitmap type query result is null (#11506)
Description of bitmap type query result is null
2022-08-08 09:51:45 +08:00
4f5db35990 [fix](date) fix the value may be changed during the parsing of date and datetime types (#11573)
* [fix](date) fix the value may be changed during the parsing of date and datetime types
2022-08-08 08:58:30 +08:00
8802a41918 fix profile may cause query slow (#11386)
Co-authored-by: Wang Bo <wangbo36@meituan.com>
2022-08-07 20:52:52 +08:00
8b9d299472 [improvement](thirdparty) Build re2 with release mode (#11578) 2022-08-07 20:50:07 +08:00
7deebf7086 [doc](asf) update .asf.ymal to stop sending notification to dev@doris (#11574) 2022-08-07 20:31:24 +08:00
bd4048f8fb [enhancement](compaction) add idle schedule and max_size limit for base compaction (#11542)
Co-authored-by: yixiutt <yixiu@selectdb.com>
2022-08-07 16:21:57 +08:00
ee4d9d4347 [improvement](test) group some cases and group a case to p0 if it is not grouped (#11548) 2022-08-06 15:12:08 +08:00
683a1261c6 [Enhancement](vectorized) Runtime Filter support equivalent slot of outer join (#11530) 2022-08-06 08:10:28 +08:00
57b7a416d2 [chore](build) add apache snapshot maven repo to repositories (#11549) 2022-08-06 07:15:28 +08:00
3070318f95 [Enhancement](IdGenerator) Use IdGeneratorBuffer to get better performance for creating tablet in fe when do alter table job (#11524)
Co-authored-by: caiconghui1 <caiconghui1@jd.com>
2022-08-05 23:27:29 +08:00
574332bd6c [fix](nereids) revert tpch regession test (#11551)
Nereids tpch regression test is faulty, rollback first,and then add a more stable test later
2022-08-05 15:55:59 +08:00
52290fed90 [tools](tpch)update queries for better performance (#11523) 2022-08-05 14:04:26 +08:00
95753ec868 [feature](parquet-reader) add group filter util (#11533)
* [feature-wip](parquet-reader) add group filter util

Co-authored-by: jinzhe <jinzhe@selectdb.com>
2022-08-05 14:02:48 +08:00
d88d1239c5 [feature] (Nereids) support limit clause (#11209)
including:
1. limit clause parser
2. implementation rule to transform LogicalLimit to PhysicalLimit
2022-08-05 11:58:45 +08:00
321107cb40 [refactor](schema change) Using tablet schema shared ptr instead of raw ptr (#11475)
* Using tabletschema shared ptr instead of raw ptrs


Co-authored-by: yiguolei <yiguolei@gmail.com>
2022-08-05 11:04:38 +08:00
74340dac44 bitmap_from_string support uint64 string, but docs is wrong (#11510)
bitmap_from_string support uint64 string, but docs is wrong
2022-08-05 08:49:38 +08:00
6eb8ac0ebf [feature-wip][multi-catalog]Support caseSensitive field name in file scan node (#11310)
* Impl case sentive in file scan node
2022-08-05 08:03:16 +08:00
b5531c5caf [BugFix](BE) fix condition index doesn't match (#11474)
* [BugFix](Be) fix condition index doesn't match
2022-08-05 07:57:18 +08:00
092a394782 [improvement](agg)limit the output of agg node (#11461)
* [improvement](agg)limit the output of agg node
2022-08-05 07:53:55 +08:00
75fc830573 [Bug](date function) fix wrong year for format '%x' (#11520) 2022-08-05 06:22:22 +08:00
b6118acc19 [feature](functions) support add_months on vectorized engine (#11518) 2022-08-04 21:39:10 +08:00
d4e6e3edfd [bugfix]fix time accuracy (#11521) 2022-08-04 21:36:20 +08:00
e11024f5cc [enhancement](Nereids)set default join type to CROSS_JOIN (#11459)
set default join type to CROSS_JOIN on join that has no equal on condition when parse sql string.
2022-08-04 21:25:24 +08:00
6dc41d57f3 [enhancement](Nereids)support count, min and avg function (#11374)
1. add count function
2. add min function
3. add avg function
2022-08-04 21:19:32 +08:00
346fdeeee0 [fix](ut) Fix BE UT BetaRowsetTest failed (#11500) 2022-08-04 17:57:57 +08:00
aed0282046 [feature-wip](parquet-reader)get compressed parquet page data (#11493) 2022-08-04 17:44:52 +08:00
591b7f3f92 [multi-catalog](oss)Support hive external table on Ali oss. (#11489) 2022-08-04 17:44:34 +08:00
ef65fbb4f0 [tools](ssb-flat)update script for better performance (#11505) 2022-08-04 17:43:26 +08:00
165f0b5ac3 [fix](auditloader)add execute permission for fe_plugins/auditloader/build.sh (#11507)
Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>
2022-08-04 17:42:31 +08:00
95091256b0 [chore](deps) update bdbje tp doris bdbje, update libhdfs3 to improve performance (#11497) 2022-08-04 17:10:56 +08:00
c176ff54a5 [improvement](orc)improve hdfs scan performance (#11501)
Co-authored-by: root <root@bytedance>
2022-08-04 17:10:30 +08:00
9f221a703b [feature-wip](statistics) step5: show statistics job information (#8862)
This pull request includes some implementations of the statistics(https://github.com/apache/incubator-doris/issues/6370), it will not affect any existing code and users will not be able to create statistics job.

It implements the display of statistics job information, with the following syntax, users will be able to view the corresponding job information.

syntax:
```
SHOW ANALYZE
    [TABLE | ID]
    [
        WHERE
        [STATE = ["PENDING"|"SCHEDULING"|"RUNNING"|"FINISHED"|"FAILED"|"CANCELLED"]]
    ]
    [ORDER BY ...]
    [LIMIT limit][OFFSET offset];
```

e.g.
| id    | create_time             | start_time              | finish_time             | error_msg | scope               | progress | state    |
| ----- | ----------------------- | ----------------------- | ----------------------- | --------- | ------------------- | -------- | -------- |
| 60051 | 2022-07-21 01:26:26.173 | 2022-07-21 01:26:26.186 | 2022-07-21 01:26:27.104 |           | table1(citycode,pv) | 5/5      | FINISHED |
2022-08-04 16:10:49 +08:00
397bf354db [WIP](optional) using hash set to distinct single value (#11246)
* [WIP](optional) using hash set to distinct single value


Co-authored-by: wangbo36@meituan.com <wangbo36@meituan.com>
2022-08-04 15:52:58 +08:00
9078ab4d24 [feature](FE): add new property to control whether use light schema change or not (#11169) 2022-08-04 15:49:05 +08:00
Pxl
ec3c911f97 [Feature][Materialized-View] support materialized view on vectorized engine (#10792) 2022-08-04 14:07:48 +08:00
35f2632c3a [doc]fix-doc (#11439)
Update get-starting.md
2022-08-04 14:00:43 +08:00