foundation-model-stack / fms-hf-tuningLinks
🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
☆55Updated last month
Alternatives and similar repositories for fms-hf-tuning
Users that are interested in fms-hf-tuning are comparing it to the libraries listed below
Sorting:
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆212Updated last week
- IBM development fork of https://github.com/huggingface/text-generation-inference☆63Updated 4 months ago
- Python library for Synthetic Data Generation☆51Updated 2 weeks ago
- Google TPU optimizations for transformers models☆132Updated last month
- Ongoing research training transformer models at scale☆42Updated this week
- ☆42Updated last year
- LM engine is a library for pretraining/finetuning LLMs☆110Updated last week
- vLLM adapter for a TGIS-compatible gRPC server.☆47Updated this week
- Train, tune, and infer Bamba model☆138Updated 7 months ago
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆277Updated this week
- Example ML projects that use the Determined library.☆32Updated last year
- Benchmark suite for LLMs from Fireworks.ai☆85Updated this week
- codebase release for EMNLP2023 paper publication☆19Updated 4 months ago
- ☆31Updated last year
- Supercharge huggingface transformers with model parallelism.☆77Updated 5 months ago
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆57Updated 4 months ago
- experiments with inference on llama☆103Updated last year
- ☆138Updated 4 months ago
- ☆51Updated 3 months ago
- ☆55Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆269Updated this week
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 4 months ago
- ☆82Updated last month
- Manage scalable open LLM inference endpoints in Slurm clusters☆278Updated last year
- Let's build better datasets, together!☆269Updated last year
- Open Implementations of LLM Analyses☆107Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- ☆53Updated 11 months ago
- ☆48Updated last year