doris

Author	SHA1	Message	Date
Pxl	be124523f4	[enhancement](profile) add profile to show column predicates (#13862 )	2022-11-02 09:07:26 +08:00
starocean999	277025b046	[fix](join)ColumnNullable need handle const column with nullable const value (#13866 )	2022-11-02 08:52:49 +08:00
yiguolei	de1dc62843	[enhancement](olap scanner) Scanner row bytes buffer is too small bug (#13874 ) * [enhancement](olap scanner) Scanner row bytes buffer is too small, please try to increase be config Co-authored-by: yiguolei <yiguolei@gmail.com>	2022-11-02 08:41:50 +08:00
Yulei-Yang	3924ecead5	[minor](load) Improve error message for string type in loading process (#13718 )	2022-11-01 22:02:33 +08:00
Yongqiang YANG	8b3afd431e	[improvement](memory) simplify memory config related to tcmalloc (#13781 ) There are several configs related to tcmalloc, users do know how to config them. Actually users just want two modes, performance or compact, in performance mode, users want doris run query and load quickly while in compact mode, users want doris run with less memory usage. If we want to config tcmalloc individually, we can use env variables which are supported by tcmalloc.	2022-11-01 21:45:19 +08:00
Gabriel	287a739510	[javaudf](string) Fix string format in java udf (#13854 )	2022-11-01 21:25:12 +08:00
Lightman	f30b974d54	[Bugfix](upgrade) Fix 1.1 upgrade 1.2 coredump when schema change (#13822 ) When upgrade 1.2 version from 1.1, FE version will don't match BE version for a period of time. After upgrade BE and doing schema change, BE will use a field desc_tbl that add in 1.2 version FE. BE will coredump because the field desc_tbl is nullptr. So it need to refuse the request.	2022-11-01 17:35:24 +08:00
TengJianPing	c14277e587	[fix](analytic) fix coredump cause by empty analytic parameter types (#13808 ) * fix fe compile error	2022-11-01 17:25:36 +08:00
Mingyu Chen	942611c185	Revert "[enhancement](compaction) opt compaction task producer and quick compaction (#13495 )" (#13833 ) This reverts commit 4f2ea0776ca3fe5315ab5ef7e00eefabfb5771a0.	2022-11-01 14:22:12 +08:00
AlexYue	7db916fc85	[enhancement](metric)Add metric for exec_state prepare function (#13646 ) * add bvar metric for exec_state prepare function	2022-11-01 14:09:47 +08:00
Gabriel	42b2725f03	[Bug](delete) Fix wrong delete operation (#13840 )	2022-11-01 13:38:43 +08:00
Pxl	164ca1e1a8	[Bug](function) change log fatal to log warning to avoid code dump on nullable double column cast to decimal column (#13819 )	2022-11-01 09:54:35 +08:00
carlvinhust2012	cc0fa5fef6	[fix](array-type) fix the be core dump when import array<largeint> (#13821 ) - this pr is used to fix the be core dump when import array. - before the change, we import array by rapidjson string will core dump under the non-vectorized scenario. - after the change, we can import array by rapidjson string successfully.	2022-10-31 22:08:55 +08:00
Pxl	57a9b0fa65	[Enhancement](chore) remove unused diagnostic (#12337 ) remove unused diagnostic	2022-10-31 19:19:13 +08:00
Kang	7ae60a0ad2	[feature](function)add url functions: domain and protocol (#13662 )	2022-10-31 19:13:08 +08:00
Mingyu Chen	2fb218173e	[improvement](scan) change the max thread num and num of free blocks in new scan (#13793 ) 1. In the previous implementation, the max thread num of olap scanner was set relatively small, such as 3. which would slow down some of queries. In this PR, I changed the max thread num to a quarter of the scaner thread pool(default is 12), which is less than the old scan node's max thread num, but larger than the previous implementation. The upper limit of the max thread num of the old scan node is too high, which is not reasonable. 2. Lower down the number of pre allocated free blocks.	2022-10-31 14:00:06 +08:00
yixiutt	4f2ea0776c	[enhancement](compaction) opt compaction task producer and quick compaction (#13495 ) 1.remove quick_compaction's rowset pick policy, call cu compaction when trigger quick compaction 2. skip tablet's compaction task when compaction score is too small Co-authored-by: yixiutt <yixiu@selectdb.com>	2022-10-31 12:24:05 +08:00
TengJianPing	2b9e1878a2	[fix](hashjoin) return error if in progress of upgrade (#13753 )	2022-10-31 09:41:20 +08:00
Pxl	711dad28fb	[Chore](unused) remove QSorter #13769	2022-10-31 08:44:39 +08:00
HappenLee	b15e0a9fb5	[Bug](function) fix bug of if function of nullable column process (#13779 )	2022-10-31 08:38:53 +08:00
Xinyi Zou	9f7c76a0d6	[fix](memtracker) Fix the usage of bthread mem tracker (#13708 ) bthead context init has performance loss, temporarily delete it first, it will be completely refactored in #13585.	2022-10-30 19:51:00 +08:00
Ashin Gau	e0667b297f	[feature-wip](multi-catalog) reuse hdfsFs and decode parquet values in batch (#13688 ) PR(https://github.com/apache/doris/pull/13404) introduced that ParquetReader will break up batch insertion when encountering null values, which leads to the bad performance compared to OrcReader. So this PR has pushed null map into decode function, reduce the time of virtual function call when encountering null values. Further more, reuse hdfsFS among file readers to reduce the time of building connection to hdfs.	2022-10-28 15:52:52 +08:00
pengxiangyu	eab8876abc	[Feature](remote) Using heavy schema change if the table is not enable light weight schema change (#13487 )	2022-10-28 15:48:22 +08:00
Pxl	2fab0c45c7	[Feature](runtime-filter) add runtime filter breaking change adapt (#13246 ) add runtime filter breaking change adapt	2022-10-28 10:59:28 +08:00
Jerry Hu	5805011629	[Feature](string-function) Add function mask/mask_first_n/mask_last_n (#13694 ) Implementation of mask function from hive.	2022-10-28 10:43:56 +08:00
HappenLee	d6b72d9b89	[Bug](update) support to check optional value of agg_sort_infos (#13732 )	2022-10-28 10:37:13 +08:00
liyipingbest	a8a91a827a	[fix] Fix the variable of boost_ROOT ,BOOST_ROOT will not work (#13450 ) When execute shell command bash build.sh --be to build the backend, the cmake tool will show can't find the boost library, because the variable of BOOST_ROOT has some spelling mistake. OS: Ubuntu 22.04 x86_64 CMake: 3.22.1 compiler: gcc (Ubuntu 11.2.0-7ubuntu2) 11.2.0	2022-10-28 08:46:35 +08:00
Adonis Ling	2ef8f3f6f4	[enhancement](java-udf) Support loading libjvm at runtime (#13660 )	2022-10-28 08:45:12 +08:00
AlexYue	20363edc73	[BugFix](function) fix reverse function dynamic buffer overflow due to illegal character (#13671 ) Previous logic of reverse function might not be strong enough to handle illegal character. For example, one one byte size character would be mistaken as one utf-8 character which occupies more than one byte space. And unfortunately exceeding the buffer space during future process.	2022-10-28 08:44:08 +08:00
TengJianPing	859ffa6304	[bugfix](concat) be crash caused by function concat(ifnull) (#13693 )	2022-10-28 08:42:51 +08:00
lsy3993	c108554f14	[function](date function) add new date function 'to_monday' #13707	2022-10-28 08:41:16 +08:00
Adonis Ling	f51464af59	[chore](macOS) Support Java UDF (#13714 )	2022-10-28 08:40:56 +08:00
zhangstar333	5dd052d386	[Function](array) support array_range function (#13547 ) * array_range with 3 impl * [Function](array) support array_range function * update * update code	2022-10-28 08:40:24 +08:00
zhangstar333	43c6428aea	[Function](string) support sub_replace function (#13736 ) * [Function](string) support sub_replace function * remove conf	2022-10-28 08:40:08 +08:00
carlvinhust2012	36053d2419	[fix](array-type) fix the be core dump when select the invalid array format (#13514 ) 1. this pr is used to fix the be core dump when select the invalid array. 2. before the change, we run "select array_intersect([1, 2, 3, 1, 2, 3], '1[3, 2, 5]');" will cause be core dump. MySQL [example_db]> select array_intersect([1, 2, 3, 1, 2, 3], '1[3, 2, 5]'); ERROR 1105 (HY000): RpcException, msg: io.grpc.StatusRuntimeException: UNAVAILABLE: Network closed for unknown reason 3. after the change, we run "select array_intersect([1, 2, 3, 1, 2, 3], '1[3, 2, 5]');" will get error message. MySQL [example_db]> select array_intersect([1, 2, 3, 1, 2, 3], '1[3, 2, 5]'); errCode = 2, detailMessage = No matching function with signature: array_intersect(array<tinyint(4)>, varchar(-1))" Co-authored-by: hucheng01 <hucheng01@baidu.com>	2022-10-27 23:11:12 +08:00
Adonis Ling	bad950136d	[chore](build) Pass the compile flag -Wno-unused-but-set-variable on demand (#13716 ) There are some issues with the compile flag `-Wno-unused-but-set-variable` for clang. 1. `-Wno-unused-but-set-variable` should be set when building source by clang-15 on Linux. (#13000 #13016) 2. On macOS Monterey, Apple Clang 13 may treat it as a unknown warning option and the compilation process may interrupt. This PR introduces a better way to make this compile flag more portable. 1. Test whether the compiler recognizes this flag. 2. Add this flag if the compiler recognizes it.	2022-10-27 15:18:28 +08:00
camby	738da0b139	[bugfix](join) inner join return wrong result (#13608 ) * bug fix for vhash join * add regression test Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2022-10-27 11:48:41 +08:00
zhannngchen	d388de6c11	[Enhancement](threadpool) print thread pool name on error (#13706 )	2022-10-27 10:49:18 +08:00
starocean999	c874931ac8	[fix](join)output all value from no-null side of outer join (#13655 ) * [fix](joinoutput all value from no-null side of outer join * add regression test	2022-10-27 10:48:36 +08:00
HappenLee	ffcb2f8525	[opt](exec) Replace get_utf8_byte_length function by array (#13664 )	2022-10-27 09:46:41 +08:00
Gabriel	3c95106d45	[Bug](jdbc) Fix memory leak for JDBC datasource (#13657 )	2022-10-27 00:02:25 +08:00
Gabriel	0134e9d2f4	[Improvement](runtime filter) Reduce merging time for bloom filter (#13668 )	2022-10-27 00:02:05 +08:00
awakeljw	06e433e14a	[fix](cmake)fix cmake error (#13637 ) fix cmake error if variables(${LIB_JVM}) is ""	2022-10-26 21:38:50 +08:00
Zhengguo Yang	65aa863dcf	[Bugfix](bitmap) Fix to_bitmap_with_check function symbol is incorrect (#13667 ) * [Bugfix](bitmap) Fix to_bitmap_with_check function symbol is incorrect	2022-10-26 14:27:38 +08:00
Tiewei Fang	c418bbd2d1	[feature-wip](new-scan) support Json reader (#13546 ) Issue Number: close #12574 This pr adds `NewJsonReader` which implements GenericReader interface to support read json format file. TODO: 1. modify `_scann_eof` later. 2. Rename `NewJsonReader` to `JsonReader` when `JsonReader` is deleted.	2022-10-26 12:52:21 +08:00
Jibing-Li	44c9163b3c	[Fix](multi-catalog)Fix partition external table query bug. (#13535 ) The index for external table columns from path is incorrect in new scanner. This is a fix for it. e.g. In the next query, nation and city columns are from path ``` mysql> select nation, city, count() from parquet_two_part group by nation, city; +--------+------------+----------+ \| nation \| city \| count() \| +--------+------------+----------+ \| cn \| beijing \| 1199969 \| \| cn \| shanghai \| 1199771 \| \| jp \| tokyo \| 599715 \| \| rus \| moscow \| 600659 \| \| us \| chicago \| 1199805 \| \| us \| washington \| 1201296 \| +--------+------------+----------+ 6 rows in set (0.39 sec) ```	2022-10-26 12:47:37 +08:00
Yongqiang YANG	295d887cf5	[improvement](thread) set name for priority thread pool (#13552 )	2022-10-26 09:32:15 +08:00
zhannngchen	2563dcca95	[fix](load) fix core dump when get_memtable_consumption_inflush (#13629 ) If delta writer is not inited, _flush_token might be nullptr.	2022-10-26 09:20:33 +08:00
huangzhaowei	17ba40f947	[feature-wip](CN Node)Support compute node (#13231 ) Introduce the node role to doris, and the table creation and tablet scheduler will control the storage only assign to the BE nodes.	2022-10-25 21:44:33 +08:00
HappenLee	2c70b17a47	[Del](vec) Support in predicate in delete condition of or and (#13587 )	2022-10-25 17:33:35 +08:00

1 2 3 4 5 ...

3083 Commits