foundation-model-stack / fms-hf-tuning
🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
☆34Updated this week
Alternatives and similar repositories for fms-hf-tuning:
Users that are interested in fms-hf-tuning are comparing it to the libraries listed below
- IBM development fork of https://github.com/huggingface/text-generation-inference☆59Updated 2 months ago
- Dolomite Engine is a library for pretraining/finetuning LLMs☆36Updated this week
- codebase release for EMNLP2023 paper publication☆19Updated 11 months ago
- Python library for Synthetic Data Generation☆32Updated this week
- 🦄 Unitxt: a python library for getting data fired up and set for training and evaluation☆176Updated this week
- Train, tune, and infer Bamba model☆84Updated last month
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data☆30Updated this week
- ☆48Updated 3 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 9 months ago
- ☆27Updated 3 months ago
- vLLM adapter for a TGIS-compatible gRPC server.☆21Updated this week
- ☆159Updated this week
- Benchmark suite for LLMs from Fireworks.ai☆66Updated last week
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆34Updated this week
- Data preparation code for Amber 7B LLM☆85Updated 9 months ago
- Estimate resources needed to train LLMs☆12Updated last week
- Using open source LLMs to build synthetic datasets for direct preference optimization☆57Updated 11 months ago
- Self-host LLMs with vLLM and BentoML☆86Updated this week
- ☆74Updated last year
- ☆24Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆54Updated 5 months ago
- ☆53Updated 8 months ago
- Pre-training code for CrystalCoder 7B LLM☆55Updated 9 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆46Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆10Updated 4 months ago
- ☆106Updated 3 weeks ago
- ☆31Updated 8 months ago
- Code for ExploreTom☆74Updated 2 months ago
- Supercharge huggingface transformers with model parallelism.☆76Updated 4 months ago