THUDM / MoELoRA_RiemannianLinks
Source code of paper: A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models. (ICML 2025)
☆34Updated 7 months ago
Alternatives and similar repositories for MoELoRA_Riemannian
Users that are interested in MoELoRA_Riemannian are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆47Updated last month
- ☆51Updated 3 months ago
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆85Updated 4 months ago
- ☆98Updated 10 months ago
- ☆51Updated 8 months ago
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆37Updated 5 months ago
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆60Updated last week
- [EMNLP 2024 Findings] Unlocking Continual Learning Abilities in Language Models☆25Updated last year
- ☆36Updated last month
- ☆43Updated 2 months ago
- codes for Efficient Test-Time Scaling via Self-Calibration☆18Updated last month
- RewardAnything: Generalizable Principle-Following Reward Models☆44Updated 4 months ago
- ☆21Updated 6 months ago
- ☆22Updated 5 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆54Updated 5 months ago
- ☆116Updated 2 weeks ago
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆19Updated 8 months ago
- AdaMoLE: Adaptive Mixture of LoRA Experts☆37Updated last year
- ☆46Updated last week
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Updated 2 months ago
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward☆85Updated 3 months ago
- A Self-Training Framework for Vision-Language Reasoning☆85Updated 9 months ago
- ☆38Updated 2 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆54Updated 7 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆109Updated 5 months ago
- ☆23Updated last year
- [NeurIPS 2025] Code for Let LLMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆50Updated last month
- Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"☆33Updated last year
- ☆28Updated 5 months ago
- ☆24Updated 2 months ago