foundation-model-stack / fms-hf-tuningLinks
π Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
β47Updated last week
Alternatives and similar repositories for fms-hf-tuning
Users that are interested in fms-hf-tuning are comparing it to the libraries listed below
Sorting:
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ61Updated 3 months ago
- π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data β¦β206Updated this week
- β42Updated last year
- LM engine is a library for pretraining/finetuning LLMsβ65Updated this week
- Python library for Synthetic Data Generationβ42Updated this week
- Benchmark suite for LLMs from Fireworks.aiβ80Updated this week
- Google TPU optimizations for transformers modelsβ120Updated 7 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing Systemβ134Updated last year
- Accelerating your LLM training to full speed! Made with β€οΈ by ServiceNow Researchβ223Updated last week
- Train, tune, and infer Bamba modelβ131Updated 2 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.β103Updated 4 months ago
- Codebase accompanying the Summary of a Haystack paper.β79Updated 11 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β33Updated 3 months ago
- Manage scalable open LLM inference endpoints in Slurm clustersβ270Updated last year
- β54Updated 9 months ago
- β75Updated last week
- β31Updated 9 months ago
- codebase release for EMNLP2023 paper publicationβ19Updated 3 months ago
- β238Updated this week
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ65Updated last year
- Fine-tune an LLM to perform batch inference and online serving.β112Updated 3 months ago
- Let's build better datasets, together!β262Updated 8 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data generaβ¦β88Updated last week
- π§ Compare how Agent systems perform on several benchmarks. ππβ100Updated 3 weeks ago
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β78Updated 10 months ago
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)β201Updated last week
- Example ML projects that use the Determined library.β32Updated 11 months ago
- Code for NeurIPS LLM Efficiency Challengeβ59Updated last year
- Unofficial implementation of https://arxiv.org/pdf/2407.14679β48Updated 11 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β83Updated this week