foundation-model-stack / fms-hf-tuningLinks
🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
☆52Updated last week
Alternatives and similar repositories for fms-hf-tuning
Users that are interested in fms-hf-tuning are comparing it to the libraries listed below
Sorting:
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆211Updated last week
- IBM development fork of https://github.com/huggingface/text-generation-inference☆62Updated 2 months ago
- Python library for Synthetic Data Generation☆51Updated last week
- ☆43Updated last year
- LM engine is a library for pretraining/finetuning LLMs☆76Updated last week
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆259Updated this week
- Google TPU optimizations for transformers models☆122Updated 9 months ago
- codebase release for EMNLP2023 paper publication☆19Updated 2 months ago
- vLLM adapter for a TGIS-compatible gRPC server.☆44Updated this week
- Train, tune, and infer Bamba model☆136Updated 5 months ago
- Let's build better datasets, together!☆264Updated 10 months ago
- Complex Function Calling Benchmark.☆148Updated 9 months ago
- ☆55Updated last year
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆245Updated this week
- A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM☆65Updated this week
- Open Implementations of LLM Analyses☆107Updated last year
- Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings☆95Updated 3 months ago
- ☆51Updated 9 months ago
- ☆138Updated 2 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆150Updated last year
- ☆46Updated last month
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 2 months ago
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- ☆42Updated last year
- Code for Zero-Shot Tokenizer Transfer☆140Updated 10 months ago
- ☆78Updated 2 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆242Updated last year
- Benchmark suite for LLMs from Fireworks.ai☆83Updated 2 weeks ago