OpenSparseLLMs / LLaMA-MoE-v2Links
π LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
β86Updated 7 months ago
Alternatives and similar repositories for LLaMA-MoE-v2
Users that are interested in LLaMA-MoE-v2 are comparing it to the libraries listed below
Sorting:
- β90Updated 2 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ76Updated 5 months ago
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Modelsβ116Updated last week
- TokenSkip: Controllable Chain-of-Thought Compression in LLMsβ164Updated 2 weeks ago
- Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shapingβ49Updated last month
- [EMNLP 2024 Findingsπ₯] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inβ¦β97Updated 8 months ago
- Model merging is a highly efficient approach for long-to-short reasoning.β71Updated last month
- β108Updated last year
- β132Updated last month
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoningβ61Updated 6 months ago
- β109Updated last month
- β143Updated 11 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibratiβ¦β40Updated last year
- β113Updated 4 months ago
- Codes for Merging Large Language Modelsβ32Updated 11 months ago
- β46Updated 3 months ago
- β36Updated 2 months ago
- [ICLR 2024 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"β86Updated 3 weeks ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-ofβ¦β31Updated last month
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.β237Updated 2 weeks ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)β96Updated last week
- An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFTβ106Updated 4 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluationsβ119Updated 2 months ago
- Extrapolating RLVR to General Domains without Verifiersβ112Updated 2 weeks ago
- Open-Pandora: On-the-fly Control Video Generationβ34Updated 7 months ago
- β51Updated last week
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"β31Updated 3 weeks ago
- β318Updated last month
- β24Updated 4 months ago
- β122Updated last month