timothelaborie / text_classification_scriptsLinks
Scripts for text classification with llama and bert
β29Updated 5 months ago
Alternatives and similar repositories for text_classification_scripts
Users that are interested in text_classification_scripts are comparing it to the libraries listed below
Sorting:
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β77Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β32Updated 3 months ago
- Train LLM on Hugging Face infraβ67Updated 2 months ago
- minimal scripts for 24GB VRAM GPUs. training, inference, whateverβ50Updated 2 weeks ago
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Trainingβ74Updated 2 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β51Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β69Updated last month
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.β77Updated 6 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β89Updated last month
- Learn the building blocks of how to build gpt-oss from scratchβ110Updated 3 months ago
- Codebase accompanying the Summary of a Haystack paper.β80Updated last year
- β51Updated 3 months ago
- β53Updated 11 months ago
- β121Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ72Updated last year
- Chunk your text using gpt4o-mini more accuratelyβ44Updated last year
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.β73Updated last year
- unsloth-5090-multipleβ60Updated 7 months ago
- Repository containing awesome resources regarding Hugging Face tooling.β48Updated 2 years ago
- Code for NeurIPS LLM Efficiency Challengeβ59Updated last year
- Notebooks for training universal 0-shot classifiers on many different tasksβ137Updated last year
- Let's build better datasets, together!β267Updated last year
- A demonstration of the paper NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddingsβ39Updated 4 months ago
- Enhancing Translation with RAG-Powered Large Language Modelsβ88Updated 2 weeks ago
- Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflowβ64Updated 2 years ago
- A collection of hand on notebook for LLMs practitionerβ51Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorchβ103Updated last year
- a curated list of the role of small models in the LLM eraβ111Updated last year
- Universal text classifier for generative modelsβ24Updated last year
- β23Updated 2 years ago