THUDM / MoELoRA_RiemannianLinks

Source code of paper: A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models. (ICML 2025)

☆34

Alternatives and similar repositories for MoELoRA_Riemannian

Users that are interested in MoELoRA_Riemannian are comparing it to the libraries listed below

Sorting:

kokolerk / TON
[NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models
☆47Updated last month
hkgc-1 / GHPO
☆51Updated 3 months ago
ASTRAL-Group / AlphaOne
[EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
☆85Updated 4 months ago
RifleZhang / LLaVA-Reasoner-DPO
☆98Updated 10 months ago
cs-holder / Reasoning-Self-Evolution-Survey
☆51Updated 8 months ago
yuhui-zh15 / AutoConverter
Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…
☆37Updated 5 months ago
waltonfuture / MM-UPT
[NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO
☆60Updated last week
wenyudu / MIGU
[EMNLP 2024 Findings] Unlocking Continual Learning Abilities in Language Models
☆25Updated last year
ZJU-REAL / LAPO
☆36Updated last month
yuelinan / Awesome-Efficient-R1-style-LRMs
☆43Updated 2 months ago
Chengsong-Huang / Self-Calibration
codes for Efficient Test-Time Scaling via Self-Calibration
☆18Updated last month
WisdomShell / RewardAnything
RewardAnything: Generalizable Principle-Following Reward Models
☆44Updated 4 months ago
hkust-nlp / GUIMid
☆21Updated 6 months ago
MikeWangWZHL / dymu
☆22Updated 5 months ago
waltonfuture / Diff-eRank
[NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models
☆54Updated 5 months ago
yhy-2000 / VideoDeepResearch
☆116Updated 2 weeks ago
EIT-NLP / Distilling-CoT-Reasoning
[ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".
☆19Updated 8 months ago
zefang-liu / AdaMoLE
AdaMoLE: Adaptive Mixture of LoRA Experts
☆37Updated last year
csbench / csbench
☆46Updated last week
tsinghua-fib-lab / SmartAgent
The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".
☆27Updated 2 months ago
kxfan2002 / SophiaVL-R1
SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward
☆85Updated 3 months ago
njucckevin / MM-Self-Improve
A Self-Training Framework for Vision-Language Reasoning
☆85Updated 9 months ago
lichengliu03 / unary-feedback
☆38Updated 2 months ago
ADaM-BJTU / Mind_with_eyes_Awesome_MLLMs_Reasoning
This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!
☆54Updated 7 months ago
RUCAIBox / Virgo
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
☆109Updated 5 months ago
mathllm / Step-Controlled_DPO
☆23Updated last year
ZJU-REAL / Self-Braking-Tuning
[NeurIPS 2025] Code for Let LLMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604
☆50Updated last month
AlignGPT-VL / AlignGPT
Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"
☆33Updated last year
EffiVLM-Bench / EffiVLM-Bench
☆28Updated 5 months ago
ZJU-REAL / cooper
☆24Updated 2 months ago