THUDM / MoELoRA_RiemannianLinks
Source code of paper: A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models. (ICML 2025)
☆35Updated 10 months ago
Alternatives and similar repositories for MoELoRA_Riemannian
Users that are interested in MoELoRA_Riemannian are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆53Updated 4 months ago
- ☆54Updated 10 months ago
- ☆23Updated last year
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆21Updated 11 months ago
- ☆51Updated 8 months ago
- ☆59Updated 6 months ago
- Reinforced Multi-LLM Agents training☆69Updated 2 weeks ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆60Updated 7 months ago
- [ICLR 2026] Adaptive Thinking via Mode Policy Optimization for Social Language Agents☆47Updated 7 months ago
- ☆24Updated 8 months ago
- ☆48Updated 5 months ago
- ☆141Updated 10 months ago
- [NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"☆30Updated 3 weeks ago
- AdaMoLE: Adaptive Mixture of LoRA Experts☆38Updated last year
- codes for Efficient Test-Time Scaling via Self-Calibration☆19Updated 4 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆57Updated 8 months ago
- ☆110Updated last year
- Aligning Agentic World Models via Knowledgeable Experience Learning☆27Updated last week
- Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"☆61Updated 2 months ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆45Updated 7 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆89Updated 11 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆131Updated 9 months ago
- A Self-Training Framework for Vision-Language Reasoning☆88Updated last year
- [ACL 2025] Knowledge Unlearning for Large Language Models☆48Updated 4 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆99Updated last year
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Updated last month
- [NeurIPS'25 Spotlight🔥] Official Implementation of RobustMerge: Parameter-Efficient Model Merging for MLLMs with Direction Robustness☆56Updated last month
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Updated 5 months ago
- PyTorch implementation of StableMask (ICML'24)☆15Updated last year
- ☆45Updated last month