foundation-model-stack / fms-hf-tuning
🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
☆33Updated this week
Alternatives and similar repositories for fms-hf-tuning:
Users that are interested in fms-hf-tuning are comparing it to the libraries listed below
- IBM development fork of https://github.com/huggingface/text-generation-inference☆58Updated last month
- Python library for Synthetic Data Generation☆31Updated this week
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data☆29Updated this week
- Train, tune, and infer Bamba model☆80Updated 2 weeks ago
- codebase release for EMNLP2023 paper publication☆19Updated 11 months ago
- Dolomite Engine is a library for pretraining/finetuning LLMs☆28Updated this week
- Data preparation code for CrystalCoder 7B LLM☆44Updated 8 months ago
- ☆48Updated 2 months ago
- Pre-training code for CrystalCoder 7B LLM☆55Updated 8 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆34Updated 9 months ago
- 🦄 Unitxt: a python library for getting data fired up and set for training and evaluation☆172Updated this week
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆29Updated 4 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆53Updated 5 months ago
- ☆24Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆63Updated last year
- Set of scripts to finetune LLMs☆36Updated 10 months ago
- Benchmark suite for LLMs from Fireworks.ai☆64Updated last month
- vLLM adapter for a TGIS-compatible gRPC server.☆17Updated this week
- The official evaluation suite and dynamic data release for MixEval.☆10Updated 4 months ago
- Supercharge huggingface transformers with model parallelism.☆76Updated 3 months ago
- ☆52Updated 7 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 10 months ago
- Code for NeurIPS LLM Efficiency Challenge☆54Updated 9 months ago
- Modeling code for a BitNet b1.58 Llama-style model.☆23Updated 9 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated last month
- TARGET is a benchmark for evaluating Table Retrieval for Generative Tasks such as Fact Verification and Text-to-SQL☆17Updated last week
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆63Updated 2 months ago
- PyTorch implementation for MRL☆18Updated 11 months ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year