cyrilou242 / ftccLinks
Fast Text Classification with Compressors dictionary
β150Updated 2 years ago
Alternatives and similar repositories for ftcc
Users that are interested in ftcc are comparing it to the libraries listed below
Sorting:
- β255Updated 2 years ago
- NLP with Rust for Python π¦πβ70Updated 7 months ago
- The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.β415Updated 10 months ago
- Embedding Vector Oriented Clusteringβ162Updated last week
- A Detailed Introduction to My Favorite Statistical Measure, Hoeffding's Dβ100Updated last year
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)β98Updated last year
- Simplified implementation of UMAP like dimensionality reduction algorithmβ53Updated last year
- Efficient BM25 with DuckDB π¦β59Updated last year
- A fast multi-core implementation of HDBSCAN for low dimensional Euclidean spacesβ126Updated last month
- Tree-based indexes for neural-searchβ31Updated last year
- Pre-train BERT from scratch, with HuggingFace. Accompanies the blog post: sidsite.com/posts/bert-from-scratchβ43Updated 7 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)β104Updated 2 months ago
- convert a scikit-learn decision tree into a Keras modelβ39Updated 2 years ago
- Implements the Tsetlin Machine, Coalesced Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, and Weighted Tsetliβ¦β164Updated 4 months ago
- β249Updated 6 months ago
- Reference implementation of "An Algorithm for Routing Vectors in Sequences" (Heinsen, 2022) and "An Algorithm for Routing Capsules in Allβ¦β171Updated 2 years ago
- Tiny inference-only implementation of LLaMAβ92Updated last year
- Various handy scripts to quickly setup new Linux and Windows sandboxes, containers and WSL.β40Updated last week
- An interactive exploration of Transformer programming.β270Updated 2 years ago
- β157Updated 2 years ago
- π Make Thinc faster on macOS by calling into Apple's native Accelerate libraryβ103Updated 6 months ago
- β60Updated 3 years ago
- Library for fast text representation and classification.β31Updated last year
- Efficiently computing & storing token n-grams from large corporaβ26Updated last year
- Prototyping a question and answer bot over PDFsβ39Updated 2 years ago
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few lβ¦β287Updated 3 months ago
- Framework for Self-Organizing Python Agentsβ29Updated last year
- A BERT that you can train on a (gaming) laptop.β210Updated 2 years ago
- Because we don't want a jupyter notebook mess...β61Updated 6 months ago
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tablesβ21Updated 7 months ago