cyrilou242 / ftccLinks
Fast Text Classification with Compressors dictionary
β150Updated 2 years ago
Alternatives and similar repositories for ftcc
Users that are interested in ftcc are comparing it to the libraries listed below
Sorting:
- β255Updated 2 years ago
- NLP with Rust for Python π¦πβ70Updated 8 months ago
- Tree-based indexes for neural-searchβ31Updated last year
- Embedding Vector Oriented Clusteringβ167Updated last week
- Reference implementation of "An Algorithm for Routing Vectors in Sequences" (Heinsen, 2022) and "An Algorithm for Routing Capsules in Allβ¦β171Updated 2 years ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)β98Updated last year
- Simplified implementation of UMAP like dimensionality reduction algorithmβ53Updated last year
- Library for fast text representation and classification.β31Updated 2 years ago
- convert a scikit-learn decision tree into a Keras modelβ39Updated 2 years ago
- β157Updated 2 years ago
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few lβ¦β287Updated last week
- Efficient BM25 with DuckDB π¦β60Updated last year
- A Detailed Introduction to My Favorite Statistical Measure, Hoeffding's Dβ100Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)β104Updated 3 months ago
- A fast multi-core implementation of HDBSCAN for low dimensional Euclidean spacesβ128Updated 2 months ago
- β251Updated 7 months ago
- Python bindings for the fast integer compression library FastPFor.β61Updated 2 years ago
- β144Updated 2 years ago
- Vectorizers for a range of different data typesβ103Updated 3 months ago
- Implements the Tsetlin Machine, Coalesced Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, and Weighted Tsetliβ¦β166Updated 5 months ago
- A word2vec negative sampling implementation with correct CBOW update.β261Updated 4 years ago
- Teaching Addition to Small Transformersβ17Updated 2 years ago
- A BERT that you can train on a (gaming) laptop.β209Updated 2 years ago
- An interactive exploration of Transformer programming.β271Updated 2 years ago
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tablesβ21Updated 8 months ago
- gzip Predicts Data-dependent Scaling Lawsβ34Updated last year
- A declarative drawing API in Pythonβ298Updated last year
- Various handy scripts to quickly setup new Linux and Windows sandboxes, containers and WSL.β40Updated last week
- UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.β146Updated 9 months ago
- Pre-train BERT from scratch, with HuggingFace. Accompanies the blog post: sidsite.com/posts/bert-from-scratchβ43Updated 8 months ago