maidacundo / MoE-LoRA
Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.
☆33Updated 6 months ago
Alternatives and similar repositories for MoE-LoRA:
Users that are interested in MoE-LoRA are comparing it to the libraries listed below
- [SIGIR'24] The official implementation code of MOELoRA.☆160Updated 9 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆118Updated 5 months ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆78Updated 5 months ago
- ☆81Updated last year
- The code of arxiv paper: "CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis"☆24Updated 3 months ago
- ☆132Updated 9 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆148Updated 7 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆124Updated 10 months ago
- ☆49Updated last year
- ☆140Updated last year
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆80Updated last year
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆80Updated last year
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆46Updated last month
- A framework for editing the CoTs for better factuality☆51Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆114Updated 7 months ago
- [ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Gener…☆60Updated 9 months ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆80Updated 2 months ago
- A Chinese National Medical Licensing Examination dataset and large languge model benchmarks☆60Updated last year
- ☆46Updated 10 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆139Updated 10 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆65Updated 4 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated 10 months ago
- this is an implementation for the paper Improve Mathematical Reasoning in Language Models by Automated Process Supervision from google de…☆28Updated 3 weeks ago
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆32Updated 4 months ago
- SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL☆16Updated 2 weeks ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆60Updated 3 months ago
- The demo, code and data of FollowRAG☆72Updated this week
- The official Github repository for paper "R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation" (EMNLP 2024 Fin…☆32Updated 4 months ago
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning☆43Updated last year
- self-adaptive in-context learning☆43Updated last year