☆95Apr 3, 2023Updated 3 years ago
Alternatives and similar repositories for Mod-Squad
Users that are interested in Mod-Squad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2023 Findings] Emergent Modularity in Pre-trained Transformers☆26Jun 7, 2023Updated 2 years ago
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆70Aug 22, 2023Updated 2 years ago
- code for paper Sparse Structure Search for Delta Tuning☆11Oct 16, 2022Updated 3 years ago
- An up-to-date list of works on Multi-Task Learning☆378Mar 2, 2026Updated 2 months ago
- Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"☆19Feb 1, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [NeurIPS 2022] “M³ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design”, Hanxue …☆136Nov 30, 2022Updated 3 years ago
- Mixture of Attention Heads☆52Oct 10, 2022Updated 3 years ago
- HD-EPIC Python script to download the entire datasets or parts of it☆20Oct 7, 2025Updated 6 months ago
- PyTorch implementation of LIMoE☆52Apr 1, 2024Updated 2 years ago
- ☆19Nov 5, 2024Updated last year
- Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch☆382Jun 17, 2024Updated last year
- The official repository for the experiments included in the paper titled "Patch-level Routing in Mixture-of-Experts is Provably Sample-ef…☆14Feb 12, 2026Updated 2 months ago
- Official Implementation of Frequency-enhanced Data Augmentation for Vision-and-Language Navigation (NeurIPS2023)☆14Jan 8, 2024Updated 2 years ago
- [ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"☆47Jul 4, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A fast MoE impl for PyTorch☆1,849Feb 10, 2025Updated last year
- Visual Representation Learning Benchmark for Self-Supervised Models☆35Apr 18, 2024Updated 2 years ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆56Feb 28, 2023Updated 3 years ago
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆83Oct 5, 2023Updated 2 years ago
- ☆22Aug 5, 2025Updated 8 months ago
- ☆44Apr 16, 2026Updated 2 weeks ago
- [ICCV 2023 oral] This is the official repository for our paper: ''Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning''.☆75Sep 24, 2023Updated 2 years ago
- ☆277Oct 31, 2023Updated 2 years ago
- Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs☆23Nov 11, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Pytorch implementation for Egoinstructor at CVPR 2024☆28Dec 1, 2024Updated last year
- The code source of MambaCSR☆27Sep 24, 2024Updated last year
- ☆27Mar 20, 2023Updated 3 years ago
- ☆289Aug 14, 2025Updated 8 months ago
- Implementation of "Towards Understanding Mixture of Experts in Deep Learning", NeurIPS 2022☆10Jan 6, 2023Updated 3 years ago
- ☆23May 8, 2025Updated 11 months ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Feb 27, 2024Updated 2 years ago
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆29May 27, 2024Updated last year
- Official code for the paper DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with Tools☆33Feb 25, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538☆1,242Apr 19, 2024Updated 2 years ago
- Official Implementation (Pytorch) of "Super-class guided Transformer for Zero-Shot Attribute Classification", AAAI 2025☆15Jan 15, 2025Updated last year
- Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"☆48Jun 2, 2025Updated 11 months ago
- [CVPR2025] Official repository for "VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide"☆28May 27, 2025Updated 11 months ago
- 【AAAI 2026】GenVidBench: A 6-Million Benchmark for AI-Generated Video Detection☆80Mar 13, 2026Updated last month
- Steering LLM Thinking with Budget Guidance☆30Feb 19, 2026Updated 2 months ago
- ☆29Jun 27, 2024Updated last year