cyrilou242 / ftcc
Fast Text Classification with Compressors dictionary
☆151Updated last year
Alternatives and similar repositories for ftcc:
Users that are interested in ftcc are comparing it to the libraries listed below
- A Detailed Introduction to My Favorite Statistical Measure, Hoeffding's D☆97Updated last year
- ☆253Updated last year
- The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.☆384Updated 3 weeks ago
- Full text search in your Pandas dataframe☆220Updated 3 months ago
- convert a scikit-learn decision tree into a Keras model☆39Updated last year
- NLP with Rust for Python 🦀🐍☆61Updated 9 months ago
- Embedding Vector Oriented Clustering☆133Updated 3 weeks ago
- ☆153Updated 2 years ago
- Gzip and nearest neighbors for text classification☆55Updated last year
- Simplified implementation of UMAP like dimensionality reduction algorithm☆47Updated 4 months ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆92Updated 3 months ago
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆280Updated last month
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA☆36Updated last year
- Visualize text embeddings☆35Updated last year
- ☆245Updated 5 months ago
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆204Updated 6 months ago
- A probabilistic approximate DNF counter☆36Updated 11 months ago
- Library for fast text representation and classification.