THUDM / MoELoRA_RiemannianLinks
Source code of paper: A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models. (ICML 2025)
☆27Updated 3 months ago
Alternatives and similar repositories for MoELoRA_Riemannian
Users that are interested in MoELoRA_Riemannian are comparing it to the libraries listed below
Sorting:
- codes for Efficient Test-Time Scaling via Self-Calibration☆14Updated 4 months ago
- ☆46Updated 2 months ago
- ☆136Updated last month
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆22Updated 5 months ago
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆40Updated last month
- [ACL 2025] Knowledge Unlearning for Large Language Models☆39Updated 2 months ago
- Can Atomic Step Decomposition Enhance the Self-structured Reasoning of Multimodal Large Models?☆24Updated 4 months ago
- ☆39Updated 5 months ago
- ☆47Updated 4 months ago
- ☆22Updated last year
- ☆21Updated 8 months ago
- Reinforced Multi-LLM Agents training☆30Updated last month
- ☆47Updated 5 months ago
- Unsupervised GRPO☆38Updated last month
- ☆83Updated 6 months ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆50Updated last month
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆64Updated last month
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆49Updated last month
- ☆15Updated 2 months ago
- ☆21Updated 2 months ago
- ☆21Updated 2 months ago
- RewardAnything: Generalizable Principle-Following Reward Models☆25Updated last month
- This is the code of MMOA-RAG.☆60Updated 2 months ago
- Github repository for "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas" (ICML 2025)☆35Updated 2 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 4 months ago
- AdaMoLE: Adaptive Mixture of LoRA Experts☆33Updated 9 months ago
- ☆48Updated last month
- ☆48Updated this week
- AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆73Updated last month
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…☆23Updated last year