A pipeline parallel training script for LLMs.
☆167Apr 30, 2025Updated 10 months ago
Alternatives and similar repositories for qlora-pipe
Users that are interested in qlora-pipe are comparing it to the libraries listed below
Sorting:
- A pipeline parallel training script for diffusion models.☆1,869Feb 8, 2026Updated last month
- A holistic way of understanding how Llama and its components run in practice, with code and detailed documentation.☆317Aug 20, 2024Updated last year
- Simple LLM inference server☆20Jun 13, 2024Updated last year
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆48Dec 14, 2024Updated last year
- A simple SDXL fine-tuning toolkit based on the DreamBooth branch of AutoTrain Advanced from 🤗, inspired by the way ai-toolkit approaches…☆18Sep 30, 2024Updated last year
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆35Jan 18, 2026Updated last month
- Utilities for efficient fine-tuning, inference and evaluation of code generation models☆21Oct 3, 2023Updated 2 years ago
- Supercharge huggingface transformers with model parallelism.☆78Jul 23, 2025Updated 7 months ago
- A script for merging a LLM model and a LoRA☆13Jun 22, 2023Updated 2 years ago
- Development repository for the Triton language and compiler☆34Oct 24, 2024Updated last year
- Set of Utilities I Have Coded to Help Me Train RPGv6 on Flux1☆84Feb 13, 2026Updated 3 weeks ago
- Advanced CLI diffusion inference/training suite based on Musubi Tuner☆40Updated this week
- Efficient visual programming for AI language models☆361May 13, 2025Updated 9 months ago
- Extend the Conditioning of Stable Diffusion to take Audio Embeddings Instead of Text Embeddings using Wav2Vec2-BERT model☆13Sep 25, 2024Updated last year
- Program that enables seamless interaction with your documents through an advanced vector database and the power of Large Language Model (…☆18Sep 12, 2023Updated 2 years ago
- Training LLMs with QLoRA + FSDP☆1,538Nov 9, 2024Updated last year
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Oct 9, 2024Updated last year
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆78Dec 17, 2024Updated last year
- ☆16Apr 23, 2024Updated last year
- Large-Language-Model to Machine Interface project.☆19Dec 5, 2023Updated 2 years ago
- ☆16Apr 7, 2024Updated last year
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- TLS & API keys for your LLM APIs☆20Dec 17, 2025Updated 2 months ago
- ☆20Jun 26, 2024Updated last year
- ☆38Jun 16, 2024Updated last year
- Benchmarking Mobile Device Control Agents across Diverse Configurations (ICLR 2024 workshop GenAI4DM spotlight presentation; CoLLAs 2025)☆35Jul 21, 2025Updated 7 months ago
- A set of nodes to edit videos using the Hunyuan Video model☆49Feb 28, 2025Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆180May 2, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆267Dec 4, 2025Updated 3 months ago
- Speech AI training and inference tools☆36Jun 25, 2023Updated 2 years ago
- QuIP quantization☆62Mar 17, 2024Updated last year
- Text WebUI extension to add clever Notebooks to Chat mode☆145Aug 7, 2025Updated 7 months ago
- ☆19Jul 11, 2024Updated last year
- automatically quant GGUF models☆220Dec 23, 2025Updated 2 months ago
- Local first human friendly agents toolkit for the browser and Nodejs☆45Feb 28, 2026Updated last week
- ☆1,724Updated this week
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- Formatron empowers everyone to control the format of language models' output with minimal overhead.☆234Jun 7, 2025Updated 9 months ago
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆754Sep 27, 2024Updated last year