THUDM / MoELoRA_Riemannian
Source code of paper: A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models.
☆16Updated last month
Alternatives and similar repositories for MoELoRA_Riemannian:
Users that are interested in MoELoRA_Riemannian are comparing it to the libraries listed below
- A Self-Training Framework for Vision-Language Reasoning☆73Updated 2 months ago
- Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal …☆46Updated last month
- An Easy-to-use Hallucination Detection Framework for LLMs.☆58Updated 11 months ago
- An benchmark for evaluating the capabilities of large vision-language models (LVLMs)☆46Updated last year
- ☆64Updated 9 months ago
- ☆54Updated 5 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆115Updated 4 months ago
- The code and data of DPA-RAG☆58Updated 2 months ago
- ☆28Updated 5 months ago
- A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond☆42Updated this week
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆70Updated 4 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆56Updated last month
- ☆70Updated 2 months ago
- The demo, code and data of FollowRAG☆70Updated 3 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆45Updated 5 months ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆33Updated 2 months ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆103Updated 2 weeks ago
- ☆35Updated 3 weeks ago
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆104Updated 5 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆97Updated last month
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆67Updated this week
- The code of arxiv paper: "CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis"☆23Updated 2 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 4 months ago
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"☆19Updated last month
- Latest Advances on Reasoning of Multimodal Large Language Models (Multimodal R1 \ Visual R1) ) 🍓☆28Updated 2 weeks ago
- Official code of paper "Speculative Ensemble: Fast Large Language Model Ensemble via Speculation"☆12Updated last month
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆79Updated last year
- ☆30Updated last week
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆43Updated 3 months ago
- ☆15Updated last month