foundation-model-stack / fms-hf-tuningLinks
π Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
β47Updated this week
Alternatives and similar repositories for fms-hf-tuning
Users that are interested in fms-hf-tuning are comparing it to the libraries listed below
Sorting:
- Python library for Synthetic Data Generationβ42Updated this week
- LM engine is a library for pretraining/finetuning LLMsβ57Updated this week
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ60Updated last month
- Estimate resources needed to train LLMsβ13Updated 4 months ago
- π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data β¦β199Updated this week
- π Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.β11Updated last week
- codebase release for EMNLP2023 paper publicationβ19Updated last month
- Train, tune, and infer Bamba modelβ127Updated 3 weeks ago
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Dataβ42Updated this week
- vLLM adapter for a TGIS-compatible gRPC server.β32Updated this week
- β39Updated 11 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β33Updated last month
- a pipeline for using api calls to agnostically convert unstructured data into structured training dataβ30Updated 9 months ago
- Pre-training code for CrystalCoder 7B LLMβ54Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.β36Updated last year
- β23Updated last year
- Unofficial implementation of https://arxiv.org/pdf/2407.14679β45Updated 9 months ago
- β51Updated 7 months ago
- The repository contains generative AI analytics platform application code.β26Updated last month
- Codebase accompanying the Summary of a Haystack paper.β78Updated 9 months ago
- FMS Model Optimizer is a framework for developing reduced precision neural network models.β20Updated last week
- Data preparation code for Amber 7B LLMβ91Updated last year
- β74Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ57Updated 9 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β78Updated 2 weeks ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.β58Updated last month
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.β23Updated 2 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeedβ35Updated last year
- Benchmark suite for LLMs from Fireworks.aiβ76Updated 2 weeks ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 11 months ago