foundation-model-stack / fms-hf-tuningLinks
🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
☆56Updated last week
Alternatives and similar repositories for fms-hf-tuning
Users that are interested in fms-hf-tuning are comparing it to the libraries listed below
Sorting:
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆212Updated 2 weeks ago
- IBM development fork of https://github.com/huggingface/text-generation-inference☆63Updated 4 months ago
- ☆43Updated last year
- Python library for Synthetic Data Generation☆52Updated last month
- Benchmark suite for LLMs from Fireworks.ai☆89Updated this week
- LM engine is a library for pretraining/finetuning LLMs☆113Updated this week
- Train, tune, and infer Bamba model☆137Updated 8 months ago
- vLLM adapter for a TGIS-compatible gRPC server.☆50Updated this week
- Ongoing research training transformer models at scale☆43Updated this week
- Google TPU optimizations for transformers models☆134Updated 2 weeks ago
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆20Updated last month
- ☆141Updated 5 months ago
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data☆49Updated this week
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆287Updated this week
- ☆125Updated last year
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆26Updated last month
- codebase release for EMNLP2023 paper publication☆19Updated 4 months ago
- ☆115Updated 5 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆280Updated last year
- ☆56Updated last year
- A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM☆220Updated last week
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆153Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 4 months ago
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …☆60Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- ☆23Updated 2 years ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- ☆82Updated 2 months ago
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆62Updated 7 months ago