Source code of paper: A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models. (ICML 2025)
☆39Apr 2, 2025Updated last year
Alternatives and similar repositories for MoELoRA_Riemannian
Users that are interested in MoELoRA_Riemannian are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Oct 22, 2025Updated 7 months ago
- The repo for HiRA paper☆38Jan 9, 2026Updated 5 months ago
- ☆126Jul 6, 2024Updated last year
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆22May 28, 2024Updated 2 years ago
- LLM-Powered Data Discovery System for Tabular Data☆32Apr 7, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆68Apr 3, 2026Updated 2 months ago
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)☆52Apr 14, 2025Updated last year
- This repository contains the implementation for our work "TopoDiffusionNet: A Topology-aware Diffusion Model", accepted to ICLR 2025.☆26Apr 17, 2025Updated last year
- ☆17Mar 10, 2025Updated last year
- Official Implementation of "IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models"☆18Jun 5, 2025Updated last year
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- [AAAI 2024] SiMA-Hand: Boosting 3D Hand-Mesh Reconstruction by Single-to-Multi-view Adaptation, Pytorch implementation.☆11Feb 6, 2024Updated 2 years ago
- [NeurIPS 2025 Spotlight] Official PyTorch implementation of Vgent☆48Nov 30, 2025Updated 6 months ago
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆11Nov 19, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code and data for QueryAgent(ACL 2024)☆20Dec 19, 2024Updated last year
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆19Dec 17, 2025Updated 6 months ago
- ☆14May 4, 2024Updated 2 years ago
- ☆20Aug 14, 2025Updated 10 months ago
- Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation, AVDN Challenge, ICCV CLVL 2023.☆21Jan 2, 2024Updated 2 years ago
- IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)☆17Nov 15, 2024Updated last year
- A curated list of research in machine learning system. I also summarize some papers if I think they are really interesting.☆10Nov 6, 2021Updated 4 years ago
- [SIGIR'24] The official implementation code of MOELoRA.☆193Jul 22, 2024Updated last year
- Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning☆28Oct 30, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"☆21Feb 29, 2024Updated 2 years ago
- ☆21Mar 19, 2025Updated last year
- An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT☆141Mar 11, 2025Updated last year
- AutoLibra: Metric Induction for Agents from Open-Ended Human Feedback☆19Apr 23, 2026Updated last month
- ☆13May 6, 2025Updated last year
- [IEEE TMI 2024] Prototype-Guided Graph Reasoning Network for Few-Shot Medical Image Segmentation☆13Jun 13, 2025Updated last year
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆50Apr 22, 2026Updated last month
- [ ICLR 2025 ] Making LLMs More Effective with Hierarchical Mixture of LoRA Experts☆32Oct 9, 2025Updated 8 months ago
- QuantClaw is a plug-and-play task-type routing quantization plugin for OpenClaw.☆116Apr 27, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for paper <SAR: Spatial-Aware Regression for 3D Hand Pose and Mesh Reconstruction from a Monocular RGB Image>. Accepted by ISMAR 202…☆11Aug 19, 2021Updated 4 years ago
- Invariant Feature Regularization for Fair Face Recognition (ICCV'23)☆15Oct 23, 2023Updated 2 years ago
- AdaMoLE: Adaptive Mixture of LoRA Experts☆38Oct 11, 2024Updated last year
- Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "☆150Apr 8, 2025Updated last year
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆37Nov 17, 2024Updated last year
- 使用django对情感分析功能进行封装,里面包含使用情感词典和Bert模型进行情感分类,最后可以使用tensorFlow serving将模型部署在docker中运行。☆12Sep 23, 2019Updated 6 years ago
- [CVPR 2025 Highlight] Official code for paper "Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-G…☆60Jun 5, 2025Updated last year