A pipeline parallel training script for LLMs.
β168Apr 30, 2025Updated last year
Alternatives and similar repositories for qlora-pipe
Users that are interested in qlora-pipe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A pipeline parallel training script for diffusion models.β1,939Apr 25, 2026Updated 2 weeks ago
- π³ MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test aβ¦β36Jan 18, 2026Updated 3 months ago
- A simple SDXL fine-tuning toolkit based on the DreamBooth branch of AutoTrain Advanced from π€, inspired by the way ai-toolkit approachesβ¦β17Sep 30, 2024Updated last year
- A holistic way of understanding how Llama and its components run in practice, with code and detailed documentation.β316Aug 20, 2024Updated last year
- Set of Utilities I Have Coded to Help Me Train RPGv6 on Flux1β84Feb 13, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- β13Jun 18, 2024Updated last year
- Advanced CLI diffusion inference/training suite based on Musubi Tunerβ40Apr 15, 2026Updated 3 weeks ago
- 5X faster 60% less memory QLoRA finetuningβ21May 28, 2024Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.β14Mar 30, 2024Updated 2 years ago
- Simple script to quiz LLMsβ29Jan 28, 2024Updated 2 years ago
- β16Apr 23, 2024Updated 2 years ago
- A script for merging a LLM model and a LoRAβ13Jun 22, 2023Updated 2 years ago
- cli tool to quantize gguf, gptq, awq, hqq and exl2 modelsβ79Dec 17, 2024Updated last year
- β1,822Updated this week
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- HunyuanVideo: A Systematic Framework For Large Video Generation Modelβ48Dec 14, 2024Updated last year
- Supercharge huggingface transformers with model parallelism.β78Jul 23, 2025Updated 9 months ago
- The best OSS video generation modelsβ135Oct 24, 2024Updated last year
- Text WebUI extension to add clever Notebooks to Chat modeβ146Aug 7, 2025Updated 9 months ago
- β12May 30, 2025Updated 11 months ago
- Large-Language-Model to Machine Interface project.β19Dec 5, 2023Updated 2 years ago
- Simple high-throughput inference libraryβ156May 14, 2025Updated 11 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language modelsβ131Apr 30, 2026Updated last week
- Utilities for efficient fine-tuning, inference and evaluation of code generation modelsβ21Oct 3, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β122Dec 18, 2024Updated last year
- β92Jul 11, 2025Updated 9 months ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage aβ¦β12Jun 25, 2024Updated last year
- Development repository for the Triton language and compilerβ34Oct 24, 2024Updated last year
- Extend the Conditioning of Stable Diffusion to take Audio Embeddings Instead of Text Embeddings using Wav2Vec2-BERT modelβ13Sep 25, 2024Updated last year
- A bagel, with everything.β326Apr 11, 2024Updated 2 years ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Modelsβ268Apr 23, 2024Updated 2 years ago
- β19Nov 28, 2025Updated 5 months ago
- Speech AI training and inference toolsβ36Jun 25, 2023Updated 2 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Diffusers Image Outpaint for ComfyUIβ91Jul 7, 2025Updated 10 months ago
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLMβ13May 30, 2025Updated 11 months ago
- L3 R3: AGM RISC-V +CPLD/FPGA MCU (AG32VH407/AG32VF407/AG32VF303)β13Nov 3, 2024Updated last year
- Parkiet is a 1.6B parameter Dutch text-to-speech model (TTS)β79Sep 30, 2025Updated 7 months ago
- A tool to help adjust or zero-out Flux Block Weights and SAVE. I'm not a dev, so this implementation might be wrong.β29Nov 20, 2024Updated last year
- An open source real-time AI inference engine for seamless scalingβ23Jul 2, 2025Updated 10 months ago
- A simple experiment on letting two local LLM have a conversation about anything!β112Jul 3, 2024Updated last year