timothelaborie / text_classification_scriptsLinks
Scripts for text classification with llama and bert
β29Updated 5 months ago
Alternatives and similar repositories for text_classification_scripts
Users that are interested in text_classification_scripts are comparing it to the libraries listed below
Sorting:
- Train LLM on Hugging Face infraβ67Updated last month
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β78Updated last year
- minimal scripts for 24GB VRAM GPUs. training, inference, whateverβ50Updated last week
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β32Updated 3 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β51Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β69Updated last month
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ72Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorchβ103Updated last year
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Trainingβ74Updated 2 months ago
- β53Updated 11 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β81Updated last year
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.β77Updated 5 months ago
- Easy to use, High Performant Knowledge Distillation for LLMsβ96Updated 8 months ago
- Repository containing awesome resources regarding Hugging Face tooling.β48Updated 2 years ago
- β104Updated 9 months ago
- β121Updated last year
- Learn the building blocks of how to build gpt-oss from scratchβ108Updated 3 months ago
- Synthetic Text Dataset Generation for LLM projectsβ55Updated last month
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [Fβ¦β69Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β89Updated last month
- Trully flash implementation of DeBERTa disentangled attention mechanism.β67Updated 3 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β45Updated last year
- Let's build better datasets, together!β267Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creationβ113Updated last year
- Codebase accompanying the Summary of a Haystack paper.β80Updated last year
- β50Updated 2 months ago
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs β¦β61Updated 11 months ago
- unsloth-5090-multipleβ60Updated 7 months ago
- β62Updated last year
- This is the reproduction repository for my π€ Hugging Face blog post on synthetic dataβ68Updated last year