PyTorch implementation of "From Sparse to Soft Mixtures of Experts"
☆71Aug 22, 2023Updated 2 years ago
Alternatives and similar repositories for soft-moe
Users that are interested in soft-moe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆83Oct 5, 2023Updated 2 years ago
- Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch☆347Apr 2, 2025Updated last year
- ☆717Dec 6, 2025Updated 5 months ago
- ☆23Oct 22, 2025Updated 7 months ago
- ☆17Jun 9, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 10 months ago
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆160Jul 9, 2025Updated 10 months ago
- ☆95Apr 3, 2023Updated 3 years ago
- GoldFinch and other hybrid transformer components☆46Jul 20, 2024Updated last year
- Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch☆384Jun 17, 2024Updated last year
- Code for the paper "Interpreting and Improving Diffusion Models from an Optimization Perspective", appearing in ICML 2024☆15Sep 30, 2024Updated last year
- sigma-MoE layer☆21Jan 5, 2024Updated 2 years ago
- The official repository for the experiments included in the paper titled "Patch-level Routing in Mixture-of-Experts is Provably Sample-ef…☆14Feb 12, 2026Updated 3 months ago
- ☆17Jun 20, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PELA: Learning Parameter-Efficient Models with Low-Rank Approximation [CVPR 2024]☆19Apr 14, 2024Updated 2 years ago
- An unofficial implementation for paper "DenseCLIP: Extract Free Dense Labels from CLIP"☆24Jan 27, 2022Updated 4 years ago
- 队伍在2023年全国大学生数学建模竞赛中选择的C题目编程过程中使用的代码,现在开源提供给大家!☆12Jan 15, 2024Updated 2 years ago
- E2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker☆55Apr 16, 2026Updated last month
- This is the offical repository for "Multi-modal Gated Mixture of Local-to-Global Experts for Dynamic Image Fusion" (ICCV 2023).☆73Apr 30, 2024Updated 2 years ago
- Learning generative models with Sinkhorn Loss☆31Nov 9, 2018Updated 7 years ago
- ☆21Oct 4, 2025Updated 7 months ago
- ☆23Mar 17, 2026Updated 2 months ago
- SegMamba-V2☆31Jun 30, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 3D-UMamba: 3D U-Net with state space model for semantic segmentation of multi-source LiDAR point clouds☆22Dec 12, 2024Updated last year
- 西电人工智能学院大二专业基础实践项目--高光谱图像目标检测☆12Jan 15, 2024Updated 2 years ago
- [NeurIPS 2024] Mixture of Experts for Audio-Visual Learning☆24Jan 19, 2025Updated last year
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆240Dec 3, 2024Updated last year
- Related papers about Referring Image Segmentation (RIS)☆16Dec 26, 2023Updated 2 years ago
- [CVPR 2025] CL-MoE: Enhancing Multimodal Large Language Model with Dual Momentum Mixture-of-Experts for Continual Visual Question Answeri…☆56Jun 16, 2025Updated 11 months ago
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆22Jul 20, 2024Updated last year
- This is a SPM12 batch script that runs a standard fMRI preprocessing pipeline on a BIDS formatted data-set.☆13Nov 19, 2020Updated 5 years ago
- SAM4SS: Tailoring SAM and SAM2 for Semantic Segmentation☆11Jul 31, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆19Oct 18, 2025Updated 7 months ago
- Streaming Thinking for VideoLLM Streaming Video Understanding☆102Updated this week
- Recent Advances in Vision-Language Pre-training!☆32Jan 10, 2022Updated 4 years ago
- Code for "Neural Rendering in a Room: Amodal 3D Understanding and Free-Viewpoint Rendering for the Closed Scene Composed of Pre-Captured …☆17Oct 15, 2023Updated 2 years ago
- [ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models☆57Jul 9, 2024Updated last year
- An awesome list that curates the best Flet tools, tutorials, blogs and more.☆10Jan 8, 2023Updated 3 years ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year