THUDM / MoELoRA_RiemannianLinks
Source code of paper: A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models. (ICML 2025)
☆33Updated 5 months ago
Alternatives and similar repositories for MoELoRA_Riemannian
Users that are interested in MoELoRA_Riemannian are comparing it to the libraries listed below
Sorting:
- ☆56Updated 3 weeks ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆44Updated this week
- ☆49Updated 6 months ago
- ☆82Updated last year
- A Self-Training Framework for Vision-Language Reasoning☆84Updated 8 months ago
- ☆19Updated 4 months ago
- ☆36Updated last month
- ☆35Updated last week
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆53Updated 3 months ago
- ☆90Updated 8 months ago
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward☆79Updated last month
- SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆133Updated 5 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆77Updated last week
- ☆29Updated last month
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆108Updated 4 months ago
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆36Updated 4 months ago
- Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"☆85Updated last month
- ☆101Updated this week
- ☆34Updated last month
- Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"☆55Updated last month
- ☆125Updated 6 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆51Updated 6 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆95Updated 9 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 3 months ago
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆82Updated 3 months ago
- More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆56Updated 3 months ago
- Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"☆33Updated last year
- ☆154Updated 3 months ago
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆48Updated last week
- ☆48Updated 4 months ago