foundation-model-stack / fms-hf-tuningLinks
π Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
β44Updated this week
Alternatives and similar repositories for fms-hf-tuning
Users that are interested in fms-hf-tuning are comparing it to the libraries listed below
Sorting:
- Estimate resources needed to train LLMsβ13Updated 3 months ago
- LM engine is a library for pretraining/finetuning LLMsβ55Updated this week
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ60Updated 3 weeks ago
- Python library for Synthetic Data Generationβ42Updated this week
- π Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.β10Updated last month
- π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data β¦β196Updated this week
- codebase release for EMNLP2023 paper publicationβ19Updated 3 weeks ago
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Dataβ42Updated this week
- Example ML projects that use the Determined library.β32Updated 8 months ago
- Train, tune, and infer Bamba modelβ127Updated last month
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuningβ42Updated 3 months ago
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)β105Updated this week
- β44Updated last year
- β49Updated 6 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.β36Updated last year
- β41Updated 2 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data generaβ¦β60Updated this week
- β93Updated last week
- β23Updated last year
- FMS Model Optimizer is a framework for developing reduced precision neural network models.β20Updated last week
- An introduction to LLM Samplingβ78Updated 5 months ago
- vLLM adapter for a TGIS-compatible gRPC server.β30Updated this week
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.β24Updated last month
- Verifiers for LLM Reinforcement Learningβ55Updated last month
- β99Updated this week
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β34Updated 3 weeks ago
- Benchmark suite for LLMs from Fireworks.aiβ75Updated 2 weeks ago
- Set of scripts to finetune LLMsβ37Updated last year
- β28Updated 4 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Ayaβ111Updated 2 weeks ago