doris

Files

Youngwb 650536d53e [Feature] Add Topn udaf (#4803 )

For #4674 
This is a udaf for approximate topn using Space-Saving algorithm.  At present, we can only calculate
the frequent items and their frequencies in a certain column, based on which we can implement similar
topN functions supported by Kylin in the future. 

I have also added a test to calculate the accuracy of this algorithm. The following is a rough running result.
The total amount of data is 1 million lines and follows the Zipfian distribution, where Element Cardinality
represents the data cardinality, 20X, 50X.. The value representing space_expand_rate is 20,50, which is
used to set the counter number in the space-saving algorithm

```
zf exponent = 0.5
Element cardinality	        20X        50X          100X
               1000		100%	   100%         100%
               10000		100%	   100%		100%
	       100000		100%	   100%		100%
	       500000		 94%	    98%		 99%

zf exponent = 0.6，1
Element cardinality	        20X        50X          100X
		1000		100%	   100%         100%
		10000		100%	   100%		100%
		100000		100%	   100%		100%
		500000		100%	   100%		100%

```

2020-12-16 21:58:34 +08:00

fe-common

[CodeRefactor] Modify FE modules (#4146 )

2020-07-29 16:18:05 +08:00

fe-core

[Feature] Add Topn udaf (#4803 )

2020-12-16 21:58:34 +08:00

spark-dpp

[SparkLoadk] Avoid to read whole hive table when we add a where (#5047 )

2020-12-15 09:26:42 +08:00

checkstyle-apache-header.txt

Add Checkstyle for doris-fe (#1353 )

2019-06-21 21:45:54 +08:00

checkstyle.xml

Add classes related to "tag". (#2343 )

2019-12-15 20:13:29 +08:00

pom.xml

[Compile] Update Repository for java-cup and cup-maven-plugin (#4769 )

2020-10-22 21:38:19 +08:00

README

[CodeRefactor] Modify FE modules (#4146 )

2020-07-29 16:18:05 +08:00

README

# fe-common

This module is used to store some common classes of other modules.

# spark-dpp

This module is Spark DPP program, used for Spark Load function.
Depends: fe-common

# fe-core

This module is the main process module of FE.
Depends: fe-common, spark-dpp