The code of Advancing Expert Specialization for Better MoE (NeurIPS2025 oral)
☆31Jan 22, 2026Updated 4 months ago
Alternatives and similar repositories for Auxloss-For-Advancing-Expert-Specialization
Users that are interested in Auxloss-For-Advancing-Expert-Specialization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch Implementation of "Better Source, Better Flow: Learning Condition-Dependent Source Distribution for Flow Matching"☆31Mar 1, 2026Updated 2 months ago
- [CVPR2026] VOSR: A Vision-Only Generative Model for Image Super-Resolution☆107Apr 12, 2026Updated last month
- [ICLR 2026 Oral] Reasoning as Representation: Rethinking Visual Reinforcement Learning in Image Quality Assessment☆31Feb 14, 2026Updated 3 months ago
- ☆17Sep 9, 2024Updated last year
- Official PyTorch implementation of "Cross-Domain Ensemble Distillation for Domain Generalization" (ECCV 2022)☆25Dec 11, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The code for KoMA☆20Jun 23, 2025Updated 11 months ago
- ☆11Jul 15, 2021Updated 4 years ago
- Curated LLM (ICML 2024)☆14Oct 23, 2024Updated last year
- Multi-Teacher Knowledge Distillation, code for my PhD dissertation. I used knowledge distillation as a decision-fusion and compressing m…☆28May 19, 2023Updated 3 years ago
- The Matlab implementation of the 5 point fundamental matrix estimator. If you use this work for Academic purposes, please cite Barath, D.…☆15Feb 26, 2019Updated 7 years ago
- Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model☆13Feb 11, 2025Updated last year
- ☆21Jul 1, 2024Updated last year
- OCR Engine☆17Dec 31, 2021Updated 4 years ago
- A comprehensive evaluation framework for the SEA region☆27Apr 20, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Decision Transformer for offline single-agent autonomous highway driving☆28Jun 19, 2023Updated 2 years ago
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 6 months ago
- ☆14Jul 13, 2025Updated 10 months ago
- CoRL 2025☆48Sep 20, 2025Updated 8 months ago
- [ICML 2025] Retraining-Free Merging of Sparse MoE via Hierarchical Clustering☆25Oct 26, 2025Updated 6 months ago
- A collection of GPU experiments and benchmarks for my personal understanding and research.☆30Apr 9, 2026Updated last month
- Example of applying CUDA graphs to LLaMA-v2☆11Aug 25, 2023Updated 2 years ago
- Pokémon damage calculator☆14Feb 7, 2024Updated 2 years ago
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆45Jun 24, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- (TPAMI 2026) Learning Continuous Wasserstein Barycenter Space for Generalized All-in-One Image Restoration☆177Apr 23, 2026Updated last month
- An open source implementation of R1☆31Updated this week
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆11Feb 13, 2024Updated 2 years ago
- ☆27Apr 7, 2026Updated last month
- ☆20Oct 31, 2022Updated 3 years ago
- [ICML 2025 Oral] Mixture of Lookup Experts☆72Dec 3, 2025Updated 5 months ago
- ☆13Mar 15, 2022Updated 4 years ago
- ☆11Dec 15, 2025Updated 5 months ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- nonebot2插件,戳一戳回复☆20May 4, 2026Updated 2 weeks ago
- A pytorch implementation of focal loss☆10Jan 9, 2020Updated 6 years ago
- Repository for the 2023 WACV paper: "Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization"☆12Dec 21, 2022Updated 3 years ago
- ☆12Jun 15, 2023Updated 2 years ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- ☆74Oct 10, 2025Updated 7 months ago
- ☆46Sep 13, 2025Updated 8 months ago