lemire / RealisticTabularDataSetsLinks
Some realistic tabular datasets for testing (CSV)
☆21Updated 7 years ago
Alternatives and similar repositories for RealisticTabularDataSets
Users that are interested in RealisticTabularDataSets are comparing it to the libraries listed below
Sorting:
- Suite of universal indexes for Highly Repetitive Document Collections☆24Updated 5 years ago
- Fast implementations of the scancount algorithm: C++ header-only library☆27Updated 6 years ago
- C++11 library for fast fuzzy searching☆14Updated 10 years ago
- Implementation of the data structures described in the paper "Fast Compressed Tries using Path Decomposition".☆58Updated 3 years ago
- A fast implementation for varbyte 32bit/64bit integer compression☆121Updated 8 years ago
- Successor to Annoy https://github.com/spotify/annoy☆13Updated 10 years ago
- A SIMD-based C++ library providing rank/select queries over mutable bitmaps.☆36Updated 3 years ago
- Deterministic Acyclic Finite State Automaton implementation for morphological analysis☆18Updated 5 years ago
- Long-term book project☆34Updated 4 years ago
- A flexible and efficient C++ implementation of the Binary Interpolative Coding algorithm.☆31Updated 3 years ago
- Finite state dictionaries in Java☆132Updated 4 years ago
- Succinct data structures in C/C++☆93Updated last year
- nkvdb - is a numeric time-series database.☆35Updated 8 years ago
- A collection of succinct data structures☆211Updated 2 years ago
- High-performance dictionary coding☆109Updated 8 years ago
- Parameterless and Universal FInding of Nearest Neighbors☆59Updated 10 months ago
- Highly optimized implementation of tiered vectors, a data structure for maintaining a sequence of n elements supporting access in time O(…☆50Updated last year
- The dream accurate approximate set cardinality estimator based on 3-bit HyperLogLog. More accurate than Redis HyperLogLog.☆55Updated 4 years ago
- String Matching Algorithms Research Tool☆108Updated last year
- A framework for building reranking models.☆28Updated 10 years ago
- Implementation of the JSON semi-index described in the paper "Semi-Indexing Semi-Structured Data in Tiny Space"☆58Updated 13 years ago
- Trinity IR Infrastructure☆239Updated 6 years ago
- An efficient trie implementation.☆255Updated 5 years ago
- A monolithic index that supports worst-case optimal joins (WCOJ) by providing all collation orders in a single redundancy eliminating dat…☆16Updated 4 months ago
- finding set bits in large bitmaps☆15Updated 10 years ago
- Benchmark showing the we can randomly hash strings very quickly with good universality☆139Updated last year
- Universe-sliced indexes in C++.☆18Updated 3 years ago
- C++17 implementation of memory-efficient dynamic tries☆58Updated 3 years ago
- Proof of concept LSM-tree built on MDB☆17Updated 10 years ago
- Python bindings for the fast integer compression library FastPFor.☆61Updated 2 years ago