Commit Graph

8276 Commits

Author SHA1 Message Date
86e47650cf Update outfile.md (#13172) 2022-10-08 20:01:20 +08:00
4386f41442 sql server 2017 version ODBC usage instructions (#13178)
sql server 2017 version ODBC usage instructions
2022-10-08 20:00:53 +08:00
6b0410450b [typo](docs)Fix jump link 404 in external storage load.md (#13173) 2022-10-08 19:59:44 +08:00
c5f802b93c [Bug](libjvm) reorder initialization of JNI (#13165) 2022-10-08 18:53:47 +08:00
b81a8789c3 [feature-wip](parquet-reader) optimize the performance of column conversion (#13122)
Convert Parquet column into doris column via batch method.
In the previous implementation, only numeric types can be converted in batches,
and other types can only be inserted one by one.
This process will generate repeated virtual function calls and container expansion.
2022-10-08 18:03:10 +08:00
5214e898d9 [fix](parquet-reader) skip data/datatime column predicate filter to avoid coredump (#13072)
Will be fixed later
Co-authored-by: jinzhe <jinzhe@selectdb.com>
2022-10-08 18:02:35 +08:00
cf2b93532b [fix](file-scanner) fix some logic about broker load with parquet with new file scanner (#13135)
Fix some logic about broker load using new file scanner, with parquet format:

1. If columns are specified in load stmt, but none of them are in parquet file,
    error will be thrown like `err: No columns found in file`. See `parquet_s3_case4`

2. If the first column of table are not in table, the result number of rows is wrong.
    See `parquet_s3_case8`

3. If column specified in `columns` in load stmt does not exist in file and table,
    error will be thrown like: `failed to find default value expr for slot: x1`. See `parquet_s3_case2`
2022-10-08 13:08:08 +08:00
63f5dc1953 [feature](Nereids): support Alias join reorder and fix bug. (#12890)
* [improve](Nereids): simplify onCondition check.

* feature: support project Alias for join reorder.
2022-10-08 10:45:04 +08:00
91cf33865d [improvement](load) config flush_thread_num_per_store to be 6 by default (#13076)
Flushing memtable is cpu bound, so 2 thread for a disk is tool small.
2022-10-08 09:16:22 +08:00
e0f17f217f [fix](test) resolve tpch_sf100_unique_p2 and tpch_sf10_unique_p2 to run in parallel (#13138) 2022-10-08 09:10:22 +08:00
71399ed771 fix data cache sidebar error (#13137)
fix data cache sidebar error
2022-10-07 17:45:21 +08:00
d902e80d6d [docs](unique-key-merge-on-write) add document for unique key merge o… (#13068) 2022-10-07 16:18:04 +08:00
8b03977689 fix bug that last line of data lost for stream load when line delimiter is more than one character (#13066) 2022-10-07 16:12:05 +08:00
447aceb223 [Fix](doc) Remove unsupported parameter (#13081) 2022-10-07 16:10:00 +08:00
b41748efa1 [feature-wip](new-scan)Add new jdbc scanner and new jdbc scan node (#12848)
Related pr: #11582
This pr is the new jdbc scan node and scanner.
2022-10-07 09:55:17 +08:00
0ccb047d45 fix slack link (#13128) 2022-10-06 18:11:14 +08:00
f2aa6e9a21 [doc](typo): fix typo (#13130) 2022-10-06 18:10:41 +08:00
441b450a79 (runtimefilter) shorter time prepare consumes (#13127)
Now, every preare put a runtime filter controller, so it takes the
mutex lock on the controller map. Init of bloom filter takes some
time in allocate and memset. If we run p1 tests with -parallel=20
-suiteParallel=20 -actionParallel=20, then we get error message like
'send fragment timeout 5s'.

The patch fixes the problem in the following 2 ways:
1. Replace one mutex block with 128s.
2. If a plan fragment does not have a runtime filter, it does not need to take
the locks.
2022-10-06 10:12:29 +08:00
218b0857ab [fix](string) allocate memory according to actual size instead of max size (#13112)
String column lengh is 2GB, if we allocate memory according to column length,
string would consume a lot of memory. It also misleads memory tracker.
2022-10-06 09:56:22 +08:00
d286aa7bf7 [fix](spark-load) no need to filter row group when doing spark load (#13116)
1. Fix issue #13115 
2. Modify the method of `get_next_block` or `GenericReader`, to return "read_rows" explicitly.
    Some columns in block may not be filled in reader, if the first column is not filled, use `block->rows()` can not return real row numbers.
3. Add more checks for broker load test cases.
2022-10-05 23:00:56 +08:00
90512ebd59 [typo](docs)Metadata Operations and Maintenance link error (#13090)
* Metadata Operations and Maintenance link error
2022-10-05 22:58:24 +08:00
7b75c2df54 [fix](BE) fix the stream load error when upgrade BE from 1.1.2 to master (#13058) 2022-10-05 12:13:26 +08:00
80e1f401f0 [enhancement](memory) Fix USE_MEM_TRACKER=OFF compile (#13085) 2022-10-05 12:10:49 +08:00
4a0b4f1836 [fix](fe-test) TestWithFeService do not clean up dorisHome (#13073) 2022-10-04 21:32:27 +08:00
b083fb6d5f [fix](decimal) retain Decimal trailing zero when select on fe (#13065) 2022-10-04 21:31:18 +08:00
74fc98ceeb [improvement](ResourceTag) support upper case in tag name (#13063) 2022-10-04 21:30:37 +08:00
984d387945 [Regression](load) Add broker load regression test. (#13062)
Add basic broker load regression test. It has been tested. But default
2022-10-04 21:29:05 +08:00
e00124d825 [typo](doc) Modify the comment of light schema change (#13061) 2022-10-04 21:28:11 +08:00
0c67b14b6d [typo](doc) replace unuse parameter max_base_compaction_concurrency (#13047) 2022-10-04 21:27:38 +08:00
3f47f67b16 [fix](parquet) fix parquet write setting property is not effective (#12912) 2022-10-04 21:25:57 +08:00
e167aa120f [fix](jdbc) fix insert into date type to oracle using wrong type (#12883)
using JDBC insert into date type to ORACLE,
it's should be use to_date function convert string to java.sql.date
2022-10-04 21:24:33 +08:00
5092ef78da [doc] Add python env for Mac M1 (#12792)
For Mac M1, the default is python3 instead of python.
When FE compiles, there will be an error that python cannot be found.
This PR complements this part of the description.
2022-10-04 21:24:08 +08:00
d10ab474f4 [fix](test) try to let cases run in parallel (#13114) 2022-10-04 20:56:22 +08:00
Pxl
db89b0b703 [Enhancement](optimize) optimize for function multiply on decimalv2 (#13049)
optimize for function multiply on decimalv2
2022-10-04 16:07:18 +08:00
0dd2fb758c [fix](test) add sync and drop table for insert.groovy and test_array_load.groovy (#13105)
We need sync for multi fe env.
2022-10-04 10:24:38 +08:00
b53533408b not allow alter mow property (#13108) 2022-10-03 21:31:09 +08:00
026ffaf10d [feature-wip](parquet-reader) add detail profile for parquet reader (#13095)
Add more detail profile for ParquetReader:
ParquetColumnReadTime: the total time of reading parquet columns
ParquetDecodeDictTime: time to parse dictionary page
ParquetDecodeHeaderTime: time to parse page header
ParquetDecodeLevelTime: time to parse page's definition/repetition level
ParquetDecodeValueTime: time to decode page data into doris column
ParquetDecompressCount: counter of decompressing page data
ParquetDecompressTime: time to decompress page data
ParquetParseMetaTime: time to parse parquet meta data
2022-10-02 15:11:48 +08:00
8b14c4aa98 [fix](compaction) don't log cumu policy name for quick compaction (#13101) 2022-10-01 21:40:42 +08:00
6fb9337095 [fix](test) add sync for some cases and adjust data path for tpch_unique_sql_zstd_p0 (#13102) 2022-10-01 21:26:50 +08:00
e9809b5721 [fix](test) add tpch_sf100 and fix results of tpcds_sf100 (#13098) 2022-10-01 20:53:04 +08:00
d44af5decf [fix](alter-load) fix bug that tablet version may be wrong when doing alter and load (#13070)
the `isRunning()` method of `TransactionState` is missing `PRE_COMMITTED` status.
Which cause wrong judgment of `isPreviousTransactionsFinished`
2022-09-30 23:39:30 +08:00
48d32de9ae [enhancement](test) add some cases from trino to p0 (#12699) 2022-09-30 21:35:30 +08:00
fd52f3bd51 [Doc](ReadME) Update the slack links (#13089) 2022-09-30 20:50:37 +08:00
95561baddd [fix](planner) throw NPE when all group by expr is constant and no agg expr in select list (#13087) 2022-09-30 18:47:01 +08:00
3294b18674 [Improvement](datev2) fix some compatible problems for datev2 (#13079) 2022-09-30 13:56:01 +08:00
e7f18e998a [chore](be-ut) Remove useless lines which cause compilation errors (#13053) 2022-09-30 11:26:25 +08:00
d73e437718 [fix](array-type) fix the be core dump when use string to insert array (#12728)
Co-authored-by: hucheng01 <hucheng01@baidu.com>
2022-09-30 10:44:27 +08:00
90f11ed7c1 [enhancement](Nereids) remove unnecessary exchange between global and distinct local aggregate node (#13057)
Add partition info into LogicalAggregate and set it as original group expression list of aggregate when we do aggregate disassemble with distinct aggregate function.
2022-09-29 23:12:37 +08:00
31a23baa37 [fix](planner) Add default execution interval time for stats framework (#13044)
Set a default execution interval for stats collection related threads.
2022-09-29 22:40:27 +08:00
7aae98eb71 [fix](comment) sparkload comment mislead which file types it support (#12982) 2022-09-29 20:23:57 +08:00