From b4fb2758176d761831071b7578976b2e1fc11752 Mon Sep 17 00:00:00 2001 From: powturbo Date: Fri, 19 Jun 2015 10:55:53 +0200 Subject: [PATCH] =?UTF-8?q?:new:=20Java+64=20bits=20lists=20for=20BitPacki?= =?UTF-8?q?ng,=20VSimple,=20VByte,=20Elias=20Fano,=E2=80=A6?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- README.md | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 24af0b9..36a77a0 100644 --- a/README.md +++ b/README.md @@ -71,7 +71,7 @@ MI/s: 1.000.000 integers/second. 1000 MI/s = 4 GB/s
TurboPForDA,TurboForDA: Direct Access is normally used when accessing individual values. ##### - Data files: - - gov2.sorted from [DocId data set](http://lemire.me/data/integercompression2014.html) Block size=128 (lz4+VSimple 64k) + - gov2.sorted from [DocId data set](#DocId data set) Block size=128 (lz4+VSimple 64k) ./icbench -c1 gov2.sorted @@ -98,7 +98,7 @@ TurboPForDA,TurboForDA: Direct Access is normally used when accessing individual | 8.594.342.216| 35.93|11.50|1307.22|1593.07|libfor| |23.918.861.764|100.00|32.00|1456.17|1480.78|Copy| -lz4 w/ delta+transpose similar to delta+[blosc](https://github.com/Blosc/c-blosc) +lz4 w/ delta+transpose similar to delta + [blosc](#blosc) ##### - Compressed Inverted Index Intersections with GOV2
GOV2: 426GB, 25 Millions documents, average doc. size=18k. @@ -261,6 +261,7 @@ header files to use with documentation:
+ [Index Compression Using 64-Bit Words](http://people.eng.unimelb.edu.au/ammoffat/abstracts/am10spe.html): Simple-8b (speed optimized version tested) + [libfor](https://github.com/cruppstahl/for) + [Document identifier data set](http://lemire.me/data/integercompression2014.html) + + [blosc](https://github.com/Blosc/c-blosc). blosc is like transpose/shuffle+lz77 + **Publications:** - [SIMD Compression and the Intersection of Sorted Integers](http://arxiv.org/abs/1401.6399) - [Partitioned Elias-Fano Indexes](http://www.di.unipi.it/~ottavian/files/elias_fano_sigir14.pdf)