rasbt / nn_plus_gzipLinks

Gzip and nearest neighbors for text classification

☆57

Alternatives and similar repositories for nn_plus_gzip

Users that are interested in nn_plus_gzip are comparing it to the libraries listed below

Sorting:

MantisAI / hugie
Command Line Interface for Hugging Face Inference Endpoints
☆66Updated last year
d-kleine / NER_decoder
Named Entity Recognition with an decoder-only (autoregressive) LLM using HuggingFace
☆43Updated 8 months ago
muellerzr / minimal-trainer-zoo
Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines
☆197Updated last year
unicamp-dl / InRanker
☆48Updated last year
jxtngx / lightning-lab
deep learning with pytorch lightning
☆1Updated 9 months ago
davanstrien / data-for-fine-tuning-llms
☆77Updated last year
warner-benjamin / commented-transformers
Highly commented implementations of Transformers in PyTorch
☆136Updated 2 years ago
rwightman / genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…
☆42Updated last year
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆33Updated 2 months ago
raphaelsty / LeNLP
NLP with Rust for Python 🦀🐍
☆64Updated 2 months ago
khuyentran1401 / pretty-print-confusion-matrix
Confusion Matrix in Python: plot a pretty confusion matrix (like Matlab) in python using seaborn and matplotlib
☆19Updated 3 years ago
MinishLab / tokenlearn
Pre-train Static Word Embeddings
☆85Updated 2 months ago
januverma / transformers-stuff
Codes, scripts, and notebooks on various aspects of transformer models.
☆27Updated 2 years ago
stas00 / ml-ways
ML/DL Math and Method notes
☆62Updated last year
argilla-io / adept-augmentations
A Python library aimed at dissecting and augmenting NER training data.
☆58Updated 2 years ago
pacman100 / peft-codegen-25
☆23Updated 2 years ago
alvarobartt / vertex-ai-huggingface-inference-toolkit
🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)
☆17Updated last year
Muhtasham / summarization-eval
📝 Reference-Free automatic summarization evaluation with potential hallucination detection
☆101Updated last year
AnswerDotAI / minai
A miniture AI training framework for PyTorch
☆41Updated 6 months ago
krypticmouse / matryoshka-representation-learning
PyTorch implementation for MRL
☆19Updated last year
huggingface / disaggregators
🤗 Disaggregators: Curated data labelers for in-depth analysis.
☆66Updated 2 years ago
hamelsmu / ft-drift
Check for data drift between two OpenAI multi-turn chat jsonl files.
☆37Updated last year
mrmps / ai-chunker
Chunk your text using gpt4o-mini more accurately
☆44Updated last year
MoritzLaurer / zeroshot-classifier
Notebooks for training universal 0-shot classifiers on many different tasks
☆133Updated 7 months ago
Knowledgator / GLiClass
Generalist and Lightweight Model for Text Classification
☆148Updated last month
dm4ml / motion
Framework for building and maintaining self-updating prompts for LLMs
☆64Updated last year
huggingface / competitions
☆124Updated 9 months ago
NielsRogge / awesome-huggingface
Repository containing awesome resources regarding Hugging Face tooling.
☆47Updated last year
chainyo / tensorshare
🤝 Trade any tensors over the network
☆30Updated last year
google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆34Updated last year