A pipeline parallel training script for LLMs.
β169Apr 30, 2025Updated last year
Alternatives and similar repositories for qlora-pipe
Users that are interested in qlora-pipe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π³ MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test aβ¦β36Jan 18, 2026Updated 4 months ago
- A holistic way of understanding how Llama and its components run in practice, with code and detailed documentation.β317Aug 20, 2024Updated last year
- Set of Utilities I Have Coded to Help Me Train RPGv6 on Flux1β84Feb 13, 2026Updated 3 months ago
- A simple SDXL fine-tuning toolkit based on the DreamBooth branch of AutoTrain Advanced from π€, inspired by the way ai-toolkit approachesβ¦β18Sep 30, 2024Updated last year
- β13Jun 18, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Simple LLM inference serverβ20Jun 13, 2024Updated last year
- Advanced CLI diffusion inference/training suite based on Musubi Tunerβ40Apr 15, 2026Updated last month
- 5X faster 60% less memory QLoRA finetuningβ21May 28, 2024Updated 2 years ago
- Efficient visual programming for AI language modelsβ361May 13, 2025Updated last year
- Training LLMs with QLoRA + FSDPβ1,545Nov 9, 2024Updated last year
- Simple script to quiz LLMsβ29Jan 28, 2024Updated 2 years ago
- Using multiple LLMs for ensemble Forecastingβ16Jan 17, 2024Updated 2 years ago
- β16Apr 23, 2024Updated 2 years ago
- A script for merging a LLM model and a LoRAβ13Jun 22, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- cli tool to quantize gguf, gptq, awq, hqq and exl2 modelsβ78Dec 17, 2024Updated last year
- HunyuanVideo: A Systematic Framework For Large Video Generation Modelβ48Dec 14, 2024Updated last year
- Supercharge huggingface transformers with model parallelism.β78Jul 23, 2025Updated 10 months ago
- The best OSS video generation modelsβ135Oct 24, 2024Updated last year
- Text WebUI extension to add clever Notebooks to Chat modeβ148Aug 7, 2025Updated 9 months ago
- β12May 30, 2025Updated last year
- Low-Rank adapter extraction for fine-tuned transformers modelsβ181May 2, 2024Updated 2 years ago
- Large-Language-Model to Machine Interface project.β19Dec 5, 2023Updated 2 years ago
- Simple high-throughput inference libraryβ156May 14, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Utilities for efficient fine-tuning, inference and evaluation of code generation modelsβ21Oct 3, 2023Updated 2 years ago
- β124Dec 18, 2024Updated last year
- Lightweight toolkit package to train and fine-tune 1.58bit Language modelsβ133Apr 30, 2026Updated 3 weeks ago
- β92Jul 11, 2025Updated 10 months ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage aβ¦β12Jun 25, 2024Updated last year
- Extend the Conditioning of Stable Diffusion to take Audio Embeddings Instead of Text Embeddings using Wav2Vec2-BERT modelβ13Sep 25, 2024Updated last year
- A bagel, with everything.β326Apr 11, 2024Updated 2 years ago
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"β32Jun 5, 2025Updated 11 months ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Modelsβ267Apr 23, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β20Nov 28, 2025Updated 6 months ago
- β324Sep 18, 2024Updated last year
- Diffusers Image Outpaint for ComfyUIβ92May 20, 2026Updated last week
- A set of nodes to edit videos using the Hunyuan Video modelβ48Feb 28, 2025Updated last year
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLMβ13May 30, 2025Updated last year
- A set of nodes to edit videos using the Hunyuan Video modelβ493Feb 21, 2025Updated last year
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.β759Sep 27, 2024Updated last year