angrysky56 / llada_guiLinks
GUI for LLaDA Diffusion LLM with Quantization for low end GPU and CPU options.
☆25Updated 11 months ago
Alternatives and similar repositories for llada_gui
Users that are interested in llada_gui are comparing it to the libraries listed below
Sorting:
- Unofficial Implementation of Evolutionary Model Merging☆41Updated last year
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆56Updated 11 months ago
- ☆84Updated 3 months ago
- Esoteric Language Models☆111Updated this week
- This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.☆106Updated last year
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆134Updated 10 months ago
- [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆108Updated 8 months ago
- Easy and Efficient dLLM Fine-Tuning☆209Updated 3 weeks ago
- Modeling code for a BitNet b1.58 Llama-style model.☆25Updated last year
- Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"☆124Updated last year
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆362Updated 8 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)☆147Updated last year
- Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"☆56Updated last year
- Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics☆71Updated 3 weeks ago
- AHN: Artificial Hippocampus Networks for Efficient Long-Context Modeling☆166Updated 3 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 9 months ago
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆159Updated 2 years ago
- A pipeline parallel training script for LLMs.☆166Updated 9 months ago
- PyTorch implementation of Titans.☆31Updated last year
- ☆19Updated last year
- The official github repo for "Diffusion Language Models are Super Data Learners".☆221Updated 3 months ago
- ☆88Updated 8 months ago
- ☆126Updated 11 months ago
- [NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models☆236Updated 3 months ago
- ☆71Updated last year
- LLaDA2.0 is the diffusion language model series developed by InclusionAI team, Ant Group.☆319Updated this week
- Implementation of DoRA☆306Updated last year
- ☆82Updated last year
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Updated 11 months ago
- Official Repository of Native Parallel Reasoner☆100Updated last week