timothelaborie / text_classification_scriptsLinks
Scripts for text classification with llama and bert
β31Updated 6 months ago
Alternatives and similar repositories for text_classification_scripts
Users that are interested in text_classification_scripts are comparing it to the libraries listed below
Sorting:
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β77Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β32Updated 4 months ago
- minimal scripts for 24GB VRAM GPUs. training, inference, whateverβ50Updated last month
- Train LLM on Hugging Face infraβ67Updated 2 months ago
- Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflowβ64Updated 2 years ago
- β53Updated 11 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β69Updated 2 months ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.β77Updated 6 months ago
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.β74Updated 2 weeks ago
- unsloth-5090-multipleβ60Updated 8 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β51Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorchβ103Updated last year
- Learn the building blocks of how to build gpt-oss from scratchβ112Updated 4 months ago
- Repository containing awesome resources regarding Hugging Face tooling.β48Updated 2 years ago
- Chunk your text using gpt4o-mini more accuratelyβ44Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMsβ97Updated 8 months ago
- β125Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β84Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β90Updated 3 weeks ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).β46Updated last year
- Composition of Multimodal Language Models From Scratchβ15Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]β88Updated last year
- Curriculum training of instruction-following LLMs with Unslothβ14Updated last month
- A list of language models with permissive licenses such as MIT or Apache 2.0β24Updated 11 months ago
- A massively multilingual modern encoder language modelβ125Updated 2 weeks ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language modelsβ110Updated 8 months ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.β73Updated last year
- Minimal zero-shot intent classifier for arbitrary intent slot filling, via LLM prompting w LangChain.β37Updated 2 years ago
- Pretraining and finetuning for visual instruction following with Mixture of Expertsβ16Updated 2 years ago
- minimal GRPO implementation from scratchβ102Updated 10 months ago