Sonictherocketman / gzip-classifierLinks
A gzip-based text-classification system.
☆34Updated last year
Alternatives and similar repositories for gzip-classifier
Users that are interested in gzip-classifier are comparing it to the libraries listed below
Sorting:
- Pre-train Static Word Embeddings☆84Updated last month
- Generalist and Lightweight Model for Text Classification☆139Updated last month
- ☆21Updated 4 years ago
- ☆75Updated last month
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆31Updated 10 months ago
- Using short models to classify long texts☆21Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 2 months ago
- Supplementary material for "Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to Adapters"☆46Updated 2 years ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆131Updated 6 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- Efficient few-shot learning with cross-encoders.☆54Updated last year
- ☆16Updated last year
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆72Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆62Updated 2 months ago
- Chunk your text using gpt4o-mini more accurately☆44Updated 11 months ago
- Ranking of fine-tuned HF models as base models.☆35Updated 2 months ago
- Simply, faster, sentence-transformers☆143Updated 10 months ago
- RaKUn 2.0 - A fast keyword detection algorithm☆67Updated 3 months ago
- Official Implementation of the 'When XGBoost Outperforms GPT-4 on Text Classification: A Case Study' NAACL-W 2024 paper☆16Updated 7 months ago
- ☆31Updated 2 years ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆83Updated 6 months ago
- ☆28Updated 2 years ago
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆66Updated 5 months ago
- ☆43Updated 2 years ago
- 🤝 Trade any tensors over the network☆30Updated last year
- ☆31Updated 2 years ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆66Updated 2 years ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆32Updated 3 months ago
- MultiOCR, an interface that connects multiple open-source OCR and various Cloud OCR.☆31Updated last year
- A Streamlit component for annotating text by text selecting.☆40Updated last year