The code of Advancing Expert Specialization for Better MoE (NeurIPS2025 oral)
☆32Jan 22, 2026Updated 4 months ago
Alternatives and similar repositories for Auxloss-For-Advancing-Expert-Specialization
Users that are interested in Auxloss-For-Advancing-Expert-Specialization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch Implementation of "Better Source, Better Flow: Learning Condition-Dependent Source Distribution for Flow Matching"☆33Mar 1, 2026Updated 3 months ago
- [CVPR2026] VOSR: A Vision-Only Generative Model for Image Super-Resolution☆125Apr 12, 2026Updated 2 months ago
- [ICLR 2026 Oral] Reasoning as Representation: Rethinking Visual Reinforcement Learning in Image Quality Assessment☆35Feb 14, 2026Updated 3 months ago
- ☆17Sep 9, 2024Updated last year
- Official PyTorch implementation of "Cross-Domain Ensemble Distillation for Domain Generalization" (ECCV 2022)☆25Dec 11, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The code for KoMA☆20Jun 23, 2025Updated 11 months ago
- ☆11Jul 15, 2021Updated 4 years ago
- Curated LLM (ICML 2024)☆14Oct 23, 2024Updated last year
- Multi-Teacher Knowledge Distillation, code for my PhD dissertation. I used knowledge distillation as a decision-fusion and compressing m…☆28May 19, 2023Updated 3 years ago
- The Matlab implementation of the 5 point fundamental matrix estimator. If you use this work for Academic purposes, please cite Barath, D.…☆15Feb 26, 2019Updated 7 years ago
- Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model☆13Feb 11, 2025Updated last year
- ☆21Jul 1, 2024Updated last year
- OCR Engine☆17Dec 31, 2021Updated 4 years ago
- A comprehensive evaluation framework for the SEA region☆29Apr 20, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Decision Transformer for offline single-agent autonomous highway driving☆28Jun 19, 2023Updated 2 years ago
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 7 months ago
- ☆14Jul 13, 2025Updated 10 months ago
- An End-to-End Benchmarking Framework for Retrieval-Augmented Generation Systems☆29Mar 13, 2026Updated 3 months ago
- CoRL 2025☆49Sep 20, 2025Updated 8 months ago
- [ICML 2025] Retraining-Free Merging of Sparse MoE via Hierarchical Clustering☆26Oct 26, 2025Updated 7 months ago
- A collection of GPU experiments and benchmarks for my personal understanding and research.☆30Apr 9, 2026Updated 2 months ago
- Example of applying CUDA graphs to LLaMA-v2☆11Aug 25, 2023Updated 2 years ago
- Pokémon damage calculator☆14Feb 7, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆46Jun 24, 2025Updated 11 months ago
- An open source implementation of R1☆31May 18, 2026Updated 3 weeks ago
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆11Feb 13, 2024Updated 2 years ago
- ☆28Apr 7, 2026Updated 2 months ago
- ☆20Oct 31, 2022Updated 3 years ago
- [ICML 2025 Oral] Mixture of Lookup Experts☆74Dec 3, 2025Updated 6 months ago
- ☆13Mar 15, 2022Updated 4 years ago
- Compact and Agent-Native MoE Training System☆144Jun 5, 2026Updated last week
- ☆11Dec 15, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- nonebot2插件,戳一戳回复☆20Jun 1, 2026Updated last week
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 10 months ago
- A pytorch implementation of focal loss☆10Jan 9, 2020Updated 6 years ago
- Repository for the 2023 WACV paper: "Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization"☆12Dec 21, 2022Updated 3 years ago
- ☆12Jun 15, 2023Updated 2 years ago
- (TPAMI 2026) Learning Continuous Wasserstein Barycenter Space for Generalized All-in-One Image Restoration☆198Apr 23, 2026Updated last month
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year