cyrilou242 / ftccLinks
Fast Text Classification with Compressors dictionary
β150Updated last year
Alternatives and similar repositories for ftcc
Users that are interested in ftcc are comparing it to the libraries listed below
Sorting:
- NLP with Rust for Python π¦πβ64Updated 2 months ago
- β252Updated 2 years ago
- Embedding Vector Oriented Clusteringβ152Updated this week
- The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.β399Updated 5 months ago
- Reference implementation of "An Algorithm for Routing Vectors in Sequences" (Heinsen, 2022) and "An Algorithm for Routing Capsules in Allβ¦β172Updated 2 years ago
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few lβ¦β286Updated last week
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)β94Updated 8 months ago
- Efficient BM25 with DuckDB π¦β54Updated 7 months ago
- A Detailed Introduction to My Favorite Statistical Measure, Hoeffding's Dβ98Updated last year
- Simplified implementation of UMAP like dimensionality reduction algorithmβ50Updated 8 months ago
- A BERT that you can train on a (gaming) laptop.β209Updated last year
- β156Updated 2 years ago
- Tiny inference-only implementation of LLaMAβ93Updated last year
- β143Updated 2 years ago
- minimal pytorch implementation of bm25 (with sparse tensors)β104Updated last year
- π Make Thinc faster on macOS by calling into Apple's native Accelerate libraryβ99Updated last month
- β247Updated last month
- Implements the Tsetlin Machine, Coalesced Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, and Weighted Tsetliβ¦β151Updated 3 months ago
- Library for fast text representation and classification.β31Updated last year
- β29Updated last year
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDAβ39Updated last year
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ101Updated last year
- A fast multi-core implementation of HDBSCAN for low dimensional Euclidean spacesβ122Updated 2 months ago
- Training code for Sparse Autoencoders on Embedding modelsβ38Updated 5 months ago
- An interactive exploration of Transformer programming.β267Updated last year
- convert a scikit-learn decision tree into a Keras modelβ39Updated last year
- Neural Searchβ333Updated last year
- Tree-based indexes for neural-searchβ32Updated last year
- A library for incremental loading of large PyTorch checkpointsβ56Updated 2 years ago
- hnsqlite integrates hnswlib and sqlite for simple text embedding searchβ161Updated 2 years ago