This repository has transferred to https://github.com/TUDB-Labs/MoE-PEFT
☆22Aug 16, 2024Updated last year
Alternatives and similar repositories for mLoRA
Users that are interested in mLoRA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆205Aug 22, 2024Updated last year
- An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT☆139Mar 11, 2025Updated last year
- The predecessor of CiteLab.☆18Feb 3, 2026Updated 3 months ago
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Models☆19Oct 24, 2024Updated last year
- Code and data for QueryAgent(ACL 2024)☆20Dec 19, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- GPTuner is a manual-reading database tuning system leveraging domain knowlege automatically and extensively to enhance knob tuning proces…☆125Jul 3, 2025Updated 10 months ago
- ☆13Jun 11, 2024Updated last year
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆404Apr 29, 2024Updated 2 years ago
- Just for debug☆57Feb 15, 2024Updated 2 years ago
- Unsupervised tableQA and databaseQA on chinese finance question and tabular data☆13Apr 20, 2023Updated 3 years ago
- ☆12Jun 5, 2024Updated last year
- Code for the arxiv paper: Complex Claim Verification with Evidence Retrieved in the Wild☆13Nov 27, 2023Updated 2 years ago
- ☆25Jan 1, 2025Updated last year
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- AIDE: the Machine Learning CodeGen Agent☆25Oct 7, 2024Updated last year
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304☆14Oct 11, 2022Updated 3 years ago
- ☆10Mar 4, 2024Updated 2 years ago
- [CIKM2023] HyperFormer: Enhancing Entity and Relation Interaction for Hyper-Relational Knowledge Graph Completion☆17Nov 26, 2024Updated last year
- ☆11Dec 19, 2024Updated last year
- Code-Style In-Context Learning for Knowledge-Based Question Answering☆14Mar 3, 2024Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆11Sep 4, 2025Updated 8 months ago
- [Recsys'2023] "RCL: Multi-Relational Contrastive Learning for Recommendation"☆17Sep 6, 2023Updated 2 years ago
- This is the repository for paper "CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models"☆31Oct 8, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Source code of our MM24 paper "Harmfully Manipulated Images Matter in Multimodal Misinformation Detection"☆19Aug 10, 2025Updated 9 months ago
- ☆23Jan 6, 2021Updated 5 years ago
- ☆16Apr 29, 2025Updated last year
- Code and data for GMT-KBQA☆17Jan 5, 2023Updated 3 years ago
- ☆12Aug 3, 2024Updated last year
- ☆11Aug 24, 2023Updated 2 years ago
- Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning☆31Jul 14, 2025Updated 10 months ago
- [Paper][CCKS2023] CausE: Towards Causal Knowledge Graph Embedding☆17Jul 30, 2023Updated 2 years ago
- ☆30Oct 23, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 针对qwen微调模型进行数据预处理☆13Jan 8, 2024Updated 2 years ago
- Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning☆60Feb 24, 2026Updated 3 months ago
- [NAACL 2024] Topics, Authors, and Institutions in Large Language Model Research: Trends from 17K arXiv Papers https://arxiv.org/abs/2307.…☆17Jan 27, 2024Updated 2 years ago
- Code for Estimating Multi-cause Treatment Effects via Single-cause Perturbation (NeurIPS 2021)☆14Jan 5, 2022Updated 4 years ago
- This is the source code for Data-free Backdoor. Our paper is accepted by the 32nd USENIX Security Symposium (USENIX Security 2023).☆33Sep 23, 2023Updated 2 years ago
- ☆10Dec 21, 2024Updated last year
- ☆12Nov 1, 2019Updated 6 years ago