cyrilou242 / ftcc
Fast Text Classification with Compressors dictionary
☆147Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ftcc
- The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.☆351Updated 2 months ago
- A Detailed Introduction to My Favorite Statistical Measure, Hoeffding's D☆95Updated 9 months ago
- Embedding Vector Oriented Clustering☆117Updated last week
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆87Updated 10 months ago
- ☆248Updated last year
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆461Updated this week
- Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overhead☆113Updated this week
- Library for fast text representation and classification.☆28Updated 10 months ago
- NLP with Rust for Python 🦀🐍☆59Updated 5 months ago
- Training code for Sparse Autoencoders on Embedding models☆33Updated 3 weeks ago
- ☆74Updated 4 months ago
- ☆49Updated last week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA