foundation-model-stack / fms-hf-tuning
🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
☆41Updated this week
Alternatives and similar repositories for fms-hf-tuning
Users that are interested in fms-hf-tuning are comparing it to the libraries listed below
Sorting:
- Estimate resources needed to train LLMs☆13Updated 2 months ago
- Dolomite Engine is a library for pretraining/finetuning LLMs☆52Updated this week
- Python library for Synthetic Data Generation☆42Updated last week
- IBM development fork of https://github.com/huggingface/text-generation-inference☆60Updated this week
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆194Updated this week
- codebase release for EMNLP2023 paper publication☆19Updated last week
- Example ML projects that use the Determined library.☆32Updated 8 months ago
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data☆40Updated this week
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆9Updated last week
- A collection of all available inference solutions for the LLMs☆87Updated 2 months ago
- Prune transformer layers☆69Updated 11 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆23Updated last month
- Inference server benchmarking tool☆57Updated 2 weeks ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last year
- Benchmark suite for LLMs from Fireworks.ai☆72Updated this week
- ☆72Updated 3 weeks ago
- Repository containing awesome resources regarding Hugging Face tooling.☆47Updated last year
- ☆36Updated 10 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 8 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated this week
- ☆207Updated this week
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆93Updated this week
- ☆48Updated 6 months ago
- Google TPU optimizations for transformers models☆109Updated 3 months ago
- experiments with inference on llama☆104Updated 11 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- An introduction to LLM Sampling☆78Updated 5 months ago
- ☆42Updated last year
- ☆49Updated last month