foundation-model-stack / fms-hf-tuningLinks

🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.

☆52

Alternatives and similar repositories for fms-hf-tuning

Users that are interested in fms-hf-tuning are comparing it to the libraries listed below

Sorting:

IBM / unitxt
🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …
☆212Updated this week
IBM / text-generation-inference
IBM development fork of https://github.com/huggingface/text-generation-inference
☆62Updated 2 months ago
instructlab / sdg
Python library for Synthetic Data Generation
☆51Updated this week
huggingface / optimum-tpu
Google TPU optimizations for transformers models
☆124Updated 10 months ago
open-lm-engine / lm-engine
LM engine is a library for pretraining/finetuning LLMs
☆77Updated this week
foundation-model-stack / bamba
Train, tune, and infer Bamba model
☆137Updated 6 months ago
fw-ai / benchmark
Benchmark suite for LLMs from Fireworks.ai
☆84Updated 2 weeks ago
ServiceNow / Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
☆265Updated this week
patronus-ai / Lynx-hallucination-detection
☆43Updated last year
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆32Updated 2 months ago
arcee-ai / DAM
☆55Updated last year
IBM / ensemble-instruct
codebase release for EMNLP2023 paper publication
☆19Updated 2 months ago
withmartian / routerbench
The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System
☆151Updated last year
vllm-project / speculators
A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM
☆140Updated this week
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆78Updated last year
mistralai / mistral-evals
☆78Updated 2 weeks ago
opendatahub-io / vllm-tgis-adapter
vLLM adapter for a TGIS-compatible gRPC server.
☆45Updated this week
siyan-zhao / prepacking
The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …
☆60Updated last year
huggingface / competitions
☆124Updated last year
foundation-model-stack / fms-acceleration
🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.
☆13Updated last week
tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆112Updated last year
zai-org / ComplexFuncBench
Complex Function Calling Benchmark.
☆149Updated 10 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated last year
luyug / magix
Supercharge huggingface transformers with model parallelism.
☆77Updated 4 months ago
QuixiAI / spectrum
☆138Updated 3 months ago
apple / ml-hypercloning
☆52Updated last year
muellerzr / nbdistributed
Seemless interface of using PyTOrch distributed with Jupyter notebooks
☆57Updated 2 months ago
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆79Updated last year
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆277Updated last year
mani-kantap / llm-inference-solutions
A collection of all available inference solutions for the LLMs
☆93Updated 9 months ago