foundation-model-stack / fms-hf-tuningLinks
π Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
β52Updated 2 weeks ago
Alternatives and similar repositories for fms-hf-tuning
Users that are interested in fms-hf-tuning are comparing it to the libraries listed below
Sorting:
- π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data β¦β211Updated last week
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ61Updated last month
- β43Updated last year
- Python library for Synthetic Data Generationβ51Updated last week
- Accelerating your LLM training to full speed! Made with β€οΈ by ServiceNow Researchβ254Updated last week
- Google TPU optimizations for transformers modelsβ121Updated 9 months ago
- Train, tune, and infer Bamba modelβ135Updated 4 months ago
- LM engine is a library for pretraining/finetuning LLMsβ74Updated this week
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β78Updated last year
- codebase release for EMNLP2023 paper publicationβ19Updated last month
- Manage scalable open LLM inference endpoints in Slurm clustersβ273Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creationβ110Updated last year
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing Systemβ147Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" π€β76Updated 10 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ60Updated last year
- Benchmark suite for LLMs from Fireworks.aiβ83Updated 2 weeks ago
- Code for NeurIPS LLM Efficiency Challengeβ59Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β33Updated last month
- β110Updated last month
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.β24Updated this week
- β124Updated last year
- Codebase accompanying the Summary of a Haystack paper.β79Updated last year
- Supercharge huggingface transformers with model parallelism.β77Updated 3 months ago
- vLLM adapter for a TGIS-compatible gRPC server.β41Updated this week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β50Updated last year
- β136Updated 2 months ago
- Open Implementations of LLM Analysesβ107Updated last year
- experiments with inference on llamaβ103Updated last year
- β79Updated 9 months ago
- β55Updated 11 months ago