[ ICLR 2025 ] Making LLMs More Effective with Hierarchical Mixture of LoRA Experts
☆32Oct 9, 2025Updated 8 months ago
Alternatives and similar repositories for HMoRA
Users that are interested in HMoRA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the NeurIPS 2018 paper "On Controllable Sparse Alternatives to Softmax"☆24Oct 10, 2019Updated 6 years ago
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆22Oct 15, 2024Updated last year
- 6 DoF Directional Room Impulse Response (RIR) with Dense Loudspeaker Grid☆17Aug 31, 2023Updated 2 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆21Nov 3, 2025Updated 7 months ago
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆403Apr 29, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- AdaMoLE: Adaptive Mixture of LoRA Experts☆38Oct 11, 2024Updated last year
- [ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.☆115Dec 20, 2024Updated last year
- ☆20Feb 2, 2026Updated 4 months ago
- [ACL 2026 Main] Analytical FFN-to-MoE Restructuring via Activation Pattern Analysis☆42Apr 24, 2026Updated last month
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆40Jan 13, 2025Updated last year
- A Maximal Mutual Information Criterion for Manipulation Concept Discovery☆13Sep 26, 2024Updated last year
- ☆13Feb 17, 2025Updated last year
- [ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models☆22Jun 13, 2024Updated 2 years ago
- Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"☆13Jun 1, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆206Aug 22, 2024Updated last year
- Repository for our paper "DeepEdit: Knowledge Editing as Decoding with Constraints". https://arxiv.org/abs/2401.10471☆21Jun 19, 2024Updated last year
- The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generat…☆31Sep 21, 2021Updated 4 years ago
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆19Dec 17, 2025Updated 5 months ago
- ☆14May 4, 2024Updated 2 years ago
- [ACM Multimedia 2021] Spatiotemporal Inconsistency Learning for DeepFake Video Detection☆11Jul 13, 2023Updated 2 years ago
- CUDA solutions for the lab assignments in the UIUC-ECE408 Applied Parallel Programming course.☆20Apr 18, 2023Updated 3 years ago
- Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".☆23Jun 10, 2024Updated 2 years ago
- [ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆90Oct 15, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Pytorch implementation of MDensenet and sparse NMF. Made for my undergraduate thesis "Music Source Separation with Supervised Learning Me…☆11Jan 31, 2021Updated 5 years ago
- MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations☆40Oct 15, 2025Updated 8 months ago
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile Representation☆16Mar 31, 2026Updated 2 months ago
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆22May 28, 2024Updated 2 years ago
- Sound field reconstruction using neural processes with dynamic kernels☆16Mar 25, 2025Updated last year
- code for the paper "ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts" (CVPR 2022)☆10Jul 17, 2022Updated 3 years ago
- ☆23Jan 7, 2025Updated last year
- ☆18Mar 9, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning☆28Jul 4, 2025Updated 11 months ago
- ☆15Mar 18, 2025Updated last year
- Multimodal datasets.☆34Jan 26, 2024Updated 2 years ago
- [ACL 2025] Squeezed Attention: Accelerating Long Prompt LLM Inference☆59Nov 20, 2024Updated last year
- KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation, NAACL 2024☆16Jul 29, 2024Updated last year
- Chinese Characters Visualization & Chinese Text Augmentation.☆17Sep 19, 2022Updated 3 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago