A pipeline parallel training script for LLMs.
☆169Apr 30, 2025Updated last year
Alternatives and similar repositories for qlora-pipe
Users that are interested in qlora-pipe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆36Jan 18, 2026Updated 5 months ago
- Set of Utilities I Have Coded to Help Me Train RPGv6 on Flux1☆84Feb 13, 2026Updated 4 months ago
- Fast, config-driven SDXL LoRA & DreamBooth fine-tuning — train from a single YAML file. QLoRA + torch.compile.☆19Updated this week
- ☆13Jun 18, 2024Updated 2 years ago
- Simple LLM inference server☆20Jun 13, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Advanced CLI diffusion inference/training suite based on Musubi Tuner☆40Apr 15, 2026Updated 2 months ago
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated 2 years ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- Efficient visual programming for AI language models☆359May 13, 2025Updated last year
- Training LLMs with QLoRA + FSDP☆1,549Nov 9, 2024Updated last year
- ☆16Apr 23, 2024Updated 2 years ago
- A script for merging a LLM model and a LoRA☆13Jun 22, 2023Updated 2 years ago
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆48Dec 14, 2024Updated last year
- Supercharge huggingface transformers with model parallelism.☆78Jul 23, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The best OSS video generation models☆135Oct 24, 2024Updated last year
- Text WebUI extension to add clever Notebooks to Chat mode☆148Aug 7, 2025Updated 10 months ago
- ☆12May 30, 2025Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆181May 2, 2024Updated 2 years ago
- Large-Language-Model to Machine Interface project.☆19Dec 5, 2023Updated 2 years ago
- Simple high-throughput inference library☆158Jun 10, 2026Updated last week
- Utilities for efficient fine-tuning, inference and evaluation of code generation models☆21Oct 3, 2023Updated 2 years ago
- ☆125Dec 18, 2024Updated last year
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆137Apr 30, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆12Jun 25, 2024Updated last year
- Development repository for the Triton language and compiler☆34Oct 24, 2024Updated last year
- A bagel, with everything.☆326Apr 11, 2024Updated 2 years ago
- Extend the Conditioning of Stable Diffusion to take Audio Embeddings Instead of Text Embeddings using Wav2Vec2-BERT model☆13Sep 25, 2024Updated last year
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆32Jun 5, 2025Updated last year
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆267Apr 23, 2024Updated 2 years ago
- ☆21Nov 28, 2025Updated 6 months ago
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Oct 9, 2024Updated last year
- ☆323Sep 18, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A set of nodes to edit videos using the Hunyuan Video model☆48Feb 28, 2025Updated last year
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆13May 30, 2025Updated last year
- QuIP quantization☆66Mar 17, 2024Updated 2 years ago
- L3 R3: AGM RISC-V +CPLD/FPGA MCU (AG32VH407/AG32VF407/AG32VF303)☆13Nov 3, 2024Updated last year
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆760Sep 27, 2024Updated last year
- Parkiet is a 1.6B parameter Dutch text-to-speech model (TTS)☆82Sep 30, 2025Updated 8 months ago
- A tool to help adjust or zero-out Flux Block Weights and SAVE. I'm not a dev, so this implementation might be wrong.☆29Nov 20, 2024Updated last year