OpenSparseLLMs / LLaMA-MoE-v2Links
π LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
β91Updated last year
Alternatives and similar repositories for LLaMA-MoE-v2
Users that are interested in LLaMA-MoE-v2 are comparing it to the libraries listed below
Sorting:
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ89Updated 11 months ago
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMsβ199Updated last month
- [EMNLP 2024 Findingsπ₯] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inβ¦β103Updated last year
- β115Updated 4 months ago
- Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shapingβ62Updated 7 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoningβ70Updated 6 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compressionβ127Updated 9 months ago
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Modelsβ153Updated 6 months ago
- β177Updated last month
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"β36Updated last year
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chainsβ69Updated 5 months ago
- [ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Accelerationβ61Updated 11 months ago
- Model merging is a highly efficient approach for long-to-short reasoning.β98Updated 3 months ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.β87Updated 11 months ago
- β47Updated 9 months ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-ofβ¦β74Updated 7 months ago
- β28Updated last year
- β33Updated 2 months ago
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.β164Updated 3 months ago
- β144Updated 4 months ago
- Official Repository of LatentSeekβ74Updated 7 months ago
- dParallel: Learnable Parallel Decoding for dLLMsβ53Updated 3 months ago
- Open-Pandora: On-the-fly Control Video Generationβ35Updated last year
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*β109Updated 7 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluationsβ143Updated 2 months ago
- Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princβ¦β38Updated 6 months ago
- β62Updated 6 months ago
- β174Updated last year
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentationβ104Updated 4 months ago
- β127Updated 7 months ago