doris

Author	SHA1	Message	Date
Mingyu Chen	539efb3532	Revert "[Fix]SlotRef.tosql() is the same as the SQL returned by different sql" (#3610 ) This revert is used to correct the mess of the commit timeline caused by the wrong merge method.	2020-05-18 13:07:21 +08:00
Mingyu Chen	20f20239f2	Revert "[Bug] fix OrCompoundPredicate predicate fold bug #3596 " (#3609 ) This revert is used to correct the mess of the commit timeline caused by the wrong merge method.	2020-05-18 13:03:24 +08:00
Mingyu Chen	ed6548e27f	Revert "[Bug] Fix bug that descriptor table is not reset before planning next routine load task (#3605 )" (#3608 ) This reverts commit 271f25f0a4e98c3d9130c0772bc386e7786cbae4. This revert is used to correct the mess of the commit timeline caused by the wrong merge method.	2020-05-18 13:00:20 +08:00
Mingyu Chen	24ca937877	Revert "[Doc] Update table-restore-tool.md" (#3606 )	2020-05-18 12:08:54 +08:00
Mingyu Chen	0d76c78537	[Doc] Update table-restore-tool.md	2020-05-18 11:12:24 +08:00
Mingyu Chen	bb7ae97845	[trace] Introduce trace util to BE Ref https://github.com/apache/incubator-doris/issues/3566 Introduce trace utility from Kudu to BE. This utility has been widely used in Kudu, Impala also import this trace utility. This trace util is used for tracing each phases in a thread, and can be dumped to string to see each phases' time cost and diagnose which phase cost more time. This util store a Trace object as a threadlocal variable, we can add trace entries which record the current file name, line number, user specified symbols and timestamp to this object, and it's able to add some counters to this Trace object. And then, it can be dumped to human readable string. There are some helpful macros defined in trace.h, here is a simple example for usage: ``` scoped_refptr<Trace> t1(new Trace); // New 2 traces scoped_refptr<Trace> t2(new Trace); t1->AddChildTrace("child_trace", t2.get()); // t1 add t2 as a child named "child_trace" TRACE_TO(t1, "step $0", 1); // Explicitly trace to t1 usleep(10); // ... do some work ADOPT_TRACE(t1.get()); // Explicitly adopt to trace to t1 TRACE("step $0", 2); // Implicitly trace to t1 { // The time spent in this scope is added to counter t1.scope_time_cost TRACE_COUNTER_SCOPE_LATENCY_US("scope_time_cost"); ADOPT_TRACE(t2.get()); // Adopt to trace to t2 for the duration of the current scope TRACE("sub start"); // Implicitly trace to t2 usleep(10); // ... do some work TRACE("sub before loop"); for (int i = 0; i < 10; ++i) { TRACE_COUNTER_INCREMENT("iterate_count", 1); // Increase counter t2.iterate_count MicrosecondsInt64 start_time = GetMonoTimeMicros(); usleep(10); // ... do some work MicrosecondsInt64 end_time = GetMonoTimeMicros(); int64_t dur = end_time - start_time; // t2's simple histogram metric with name prefixed with "lbm_writes" const char* counter = BUCKETED_COUNTER_NAME("lbm_writes", dur); TRACE_COUNTER_INCREMENT(counter, 1); } TRACE("sub after loop"); } TRACE("goodbye $0", "cruel world"); // Automatically restore to trace to t1 std::cout << t1->DumpToString(Trace::INCLUDE_ALL) << std::endl; ``` output looks like: ``` 0514 02:16:07.988054 (+ 0us) trace_test.cpp:76] step 1 0514 02:16:07.988112 (+ 58us) trace_test.cpp:80] step 2 0514 02:16:07.988863 (+ 751us) trace_test.cpp:103] goodbye cruel world Related trace 'child_trace': 0514 02:16:07.988120 (+ 0us) trace_test.cpp:85] sub start 0514 02:16:07.988188 (+ 68us) trace_test.cpp:88] sub before loop 0514 02:16:07.988850 (+ 662us) trace_test.cpp:101] sub after loop Metrics: {"scope_time_cost":744,"child_traces":[["child_trace",{"iterate_count":10,"lbm_writes_lt_1ms":10}]]} ``` Exclude the original source code, this patch do the following work to adapt to Doris: - Rename "kudu" namespace to "doris" - Update some names to the existing function names in Doris, i.g. strings::internal::SubstituteArg::kNoArg -> strings::internal::SubstituteArg::NoArg - Use doris::SpinLock instead of kudu::simple_spinlock which hasn't been imported - Use manual malloc() and free() instead of kudu::Arena which hasn't been imported - Use manual rapidjson::Writer instead of kudu::JsonWriter which hasn't been imported - Remove all TRACE_EVENT related unit tests since TRACE_EVENT is not imported this time - Update CMakeLists.txt	2020-05-18 11:10:25 +08:00
chenmingyu	d4ff6dcdd6	fix by review	2020-05-18 10:56:12 +08:00
Mingyu Chen	2f3b7b5b8e	[Refactor] Refactor some redundant code && Replace some UT by UtFrameUtils	2020-05-18 10:53:32 +08:00
Mingyu Chen	bca9fb8551	[Bug] Fix bug that ConcurrentModificationException thrown When truncate the table, a ConcurrentModificationException may thrown when there are temp partitions in this table.	2020-05-18 10:48:19 +08:00
Mingyu Chen	5276b5a4a3	[Bug] Fix bug that DbTxnMgr does not create for db in CatalogRecycleBin Fix #3589 The reason is that: When load meta from image, we will create `DatabaseTransactionMgr` for each database loaded from `loadDb()` method. But we forget to create `DatabaseTransactionMgr` for database in the catalog recycle bin.	2020-05-18 10:42:17 +08:00
Mingyu Chen	62f746fc87	[Fix] SlotRef.tosql() is the same as the SQL returned by different sql	2020-05-18 10:41:15 +08:00
Mingyu Chen	e6588981b4	[Bug] fix OrCompoundPredicate predicate fold bug #3596 (#3597 ) * [Bug] fix OrCompoundPredicate predicate fold bug * fix code style	2020-05-18 10:36:13 +08:00
Mingyu Chen	271f25f0a4	[Bug] Fix bug that descriptor table is not reset before planning next routine load task (#3605 ) Before planning for next routine load task, the analyzer and descriptor table in it should be reset. Otherwise, a lot of historical objects will accumulate inside, causing memory leaks.	2020-05-18 10:34:21 +08:00
wutiangan	5138197d57	[Bug] generate exceptions to avoid mulitDistinctAggregation produces wrong results (#3561 ) when a query (#3492) contain “2 DistinctAggregation with one column” and “1 DistinctAggregation with two columns”, it will produce wrong result. This pull request is not to solve this problem really, but to generate exceptions to avoid getting wrong results. This problem needs a real repair in future.	2020-05-16 21:36:43 +08:00
HappenLee	7bf926eba8	[Profile] Improve the running profile 1. Delete Invalid Counter In Data_Stream_Sender. (#3598) 2. Add Counter For PartitionHashTable of PartitionAggregationNode: * Hash Probe Method * Row processed by Aggregation * HashFilledBuckets: Counter How Many FilledBuckets in Aggragation * HTResize: Counter How Many Resize of HashTable * HashProbe: Counter Probe of HashTable * HashFailedProbe: Counter Failed Probe of HashTable * HashTravelLength: Total TravelLength for Probe * HashCollisions: Counter of HashCollision 3. Del some unecessary code in PartitionHashTable by template	2020-05-16 21:35:30 +08:00
morningman	8cb48161e3	change to current catalog	2020-05-16 21:12:46 +08:00
marising	4217db00d3	Tosql method returns slot index and column name	2020-05-15 17:31:25 +08:00
morningman	c50b1a4d17	fix bug	2020-05-15 16:15:53 +08:00
turbo jason	0d457692bc	[incubator-doris][thirdpary][glog][bug] Calucate file length at the be start (#3594 )	2020-05-15 15:15:54 +08:00
hffariel	a4e98953be	[website] modify download links & remove some links' suffix `_EN`(master) (#3573 ) modify download links & remove some links' suffix _EN	2020-05-15 14:03:28 +08:00
yangzhg	a7e1c08624	Report error when subquery in case-when returns empty set (#3558 ) The doris rewrite the subquery in case-when to inline view. So it the result is different between subquery in case-when and inline view. We could not support the empty set of subquery in case-when. This commit forbidden this case.	2020-05-15 12:32:05 +08:00
wutiangan	8be10dca05	fix code style	2020-05-15 12:10:19 +08:00
wangcong18	805ecc9d4e	fix	2020-05-15 11:23:01 +08:00
wutiangan	0919407092	[Bug] fix OrCompoundPredicate predicate fold bug	2020-05-15 10:20:13 +08:00
Dayue Gao	273aad6cf4	[Bug] Restore tablet action not working because tablet status is shutdown (#3551 )	2020-05-15 10:11:17 +08:00
yangzhg	123e1394b1	[Delete] Allow delete duplicated non-key column using delete from (#3424 )	2020-05-15 09:26:36 +08:00
Yingchun Lai	9fc2554e6c	indentation	2020-05-14 14:45:22 +00:00
yangzhg	4464328d8f	[Doc] Add doc link to char_length (#3548 )	2020-05-14 21:21:31 +08:00
morningman	f162596d32	fix bug	2020-05-14 16:13:39 +08:00
morningman	fc02ce8034	tmp_partition_qa1	2020-05-14 15:21:05 +08:00
wutiangan	9f224cdd8a	[Bug] Fix bug of Partition prune of constant in predicate (#3476 ) 1. phenomenon： The following two statements are the same, but a query has results and the other query has no results mysql> select * from (select '积极' as kk1, sum(k2) from table_range where k1 = '2013-01-01' group by kk1)tt where kk1 = '积极'; +--------+-----------+ \| kk1 \| sum(`k2`) \| +--------+-----------+ \| 积极 \| 1 \| +--------+-----------+ 1 row in set (0.01 sec) mysql> select * from (select '积极' as kk1, sum(k2) from table_range where k1 = '2013-01-01' group by kk1)tt where kk1 in ('积极'); Empty set (0.01 sec) 2. reason： In partition prune, constant in predicate（‘积极’ in ‘积极’） is mistakenly considered to meet partition prune conditions, and mistakenly regarded as partition prune column. Then in partition prune , no partition is considered to meet the requirements, so it is planned to be 0 partition in query planning	2020-05-14 11:46:13 +08:00
yangzhg	d9e455124a	[FIX] fix some doris web page dispaly error (#3544 )	2020-05-14 11:38:42 +08:00
morningman	b08e08b3ba	first	2020-05-14 09:25:51 +08:00
hffariel	47bce081d2	[website] Support documents' fulltext searching (master) (#3535 ) add documents' fulltext search powered by algolia	2020-05-13 21:18:42 +08:00
Zhao Chun	95c67db712	[community] Add Committer Guide (#3522 )	2020-05-13 21:17:12 +08:00
Yingchun Lai	8406723912	adapt to Doris	2020-05-13 12:13:47 +00:00
Yingchun Lai	e066791e47	import original files	2020-05-13 19:03:20 +08:00
Mingyu Chen	40cd5365ce	[Doc] Update table-restore-tool.md Fix some format.	2020-05-13 18:51:11 +08:00
Binglin Chang	a7cfafe076	[Memory Engine] add core column related classes (#3508 ) add core column related classes	2020-05-13 16:30:32 +08:00
Mingyu Chen	54e38ecda2	[Bug] Fix bug of transaction manager (#3565 ) Fix bug of using wrong `abortTransaction()` method	2020-05-13 15:45:15 +08:00
marising	f9f3a84e9d	fixed bug:SlotRef.tosql() is the same as the SQL returned by different SQL	2020-05-13 15:06:44 +08:00
Mingyu Chen	ca7c0717cd	Fix compile bug (#3557 )	2020-05-12 10:24:37 +08:00
caiconghui	b648734441	[TxxMgr] Support txn management in db level and use ArrayDeque to improve txn task performance (#3369 ) This PR is the first step to make Doris stream load more robust with higher concurrent performance(#3368)，the main work is to support txn management in db level isolation and use ArrayDeque to stored final status txns.	2020-05-11 23:32:43 +08:00
WingC	4294301c53	Throw DdlException when use `admin set frontend config` (#3539 ) The set more than one config in a single set config stmt, an exception will be thrown to forbid the operation.	2020-05-11 23:29:38 +08:00
Binglin Chang	63fecc7954	Remove unused ColumnType (#3532 )	2020-05-11 18:57:47 +08:00
令狐少侠	5a57ecca15	[Doris On ES]fix bug of query failed in doc_value_mode when fields have none value (#3513 ) #3479 Here I try to explain the cause of the problem and how to fix it. The Cause of The problem Take the case in issue(#3479 ) as an example: The general results are as follows: ``` GET table/_doc/_search {"query":{"match_all":{}},"stored_fields":"_none_","docvalue_fields":["k1"],"sort":["_doc"],"size":100} { "took": 6, "timed_out": false, "_shards": { …… }, "hits": { "total": 3, "max_score": null, "hits": [ { "_index": "table", "_score": null, "sort": [ 0 ] }, { "_index": "table", "_score": null, "fields": { "k1": [ "kkk1" ] }, "sort": [ 0 ] }, { "_index": "table", "_score": null, "sort": [ 0 ] } ] } } ``` But in Doris on ES，Be fetched data parallelly on all shards, and use `filter_path` to reduce the network cost. The process will be as follows: ``` GET table/_doc/_search?preference=_shards:1&filter_path=_scroll_id,hits.hits._source,hits.total,_id,hits.hits._source.fields,hits.hits.fields {"query":{"match_all":{}},"stored_fields":"_none_","docvalue_fields":["k1"],"sort":["_doc"],"size":100} { "hits": { "total": 0 } } GET table/_doc/_search?preference=_shards:2&filter_path=_scroll_id,hits.hits._source,hits.total,_id,hits.hits._source.fields,hits.hits.fields {"query":{"match_all":{}},"stored_fields":"_none_","docvalue_fields":["k1"],"sort":["_doc"],"size":100} { "hits": { "total": 1 } } GET table/_doc/_search?preference=_shards:3&filter_path=_scroll_id,hits.hits._source,hits.total,_id,hits.hits._source.fields,hits.hits.fields {"query":{"match_all":{}},"stored_fields":"_none_","docvalue_fields":["k1"],"sort":["_doc"],"size":100} { "hits": { "total": 1, "hits": [ { "fields": { "k1": [ "kkk1" ] } } ] } } ``` Scan-Worker On BE which processed result of shard2 will failed. The reasons are as follows: 1. "filter_path" causes the hits.hits object not exist. 2. In the current implementation, if there are some data rows（total > 0）, the hits.hits. object must be an array How To Fix it Two Method: 1. modify "filter_path" to contain the hits. Pros: Fixed Code is very simple Cons: More network cost 2. Deal with the case where fields are missing in a batch. Pros: No loss of performance Cons: Code is more complex Performance first, I use Method2. Design 1. Add a variable "_doc_value_mode" into Class "EsScrollParser" to =indicate whether the data processed by this parser is doc_value_mode or not. 2. "_doc_value_mode" is passed from ESScollReader <- ESScanner <- ScrollQueryBuilder::build() that determines whether DSL is enable doc_value_mode 3. When hits.hits of response from ES is empty and total > 0. We know there are data lines, but the corresponding fields do not exist. EsScrollParser will use "_doc_value_mode" and _total to construct _total lines which fields are assigned with 'NULL'	2020-05-11 15:34:12 +08:00
WingC	edbeaf8e30	Throw a UserException when miss plugin's md5 file (#3542 )	2020-05-11 15:33:35 +08:00
HuangWei	57cbfb772d	Add -Werror when gcc<=7.3.0 & udf fix (#3533 )	2020-05-11 10:31:38 +08:00
Yingchun Lai	b576e54fe6	[ASAN] Fix some address problems detected by ASAN (#3495 ) LSAN detected errors have been fixed by a prior pathch (#3326), but there are still some ASAN detected errors. This patch try to fix these errors to make Doris BE more robustness. And then we can add CI run in LSAN/ASAN mode to detect memory errors as early as possible.	2020-05-11 10:30:45 +08:00
Lijia Liu	561765fc08	Identify old empty tablet when add tablet to meta in ReportHandler (#3547 )	2020-05-11 09:50:43 +08:00

... 327 328 329 330 331 ...

18263 Commits