PyTorch implementation of LIMoE
☆52Apr 1, 2024Updated 2 years ago
Alternatives and similar repositories for LIMoE-pytorch
Users that are interested in LIMoE-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with…☆37May 12, 2026Updated 2 weeks ago
- SWIS: Self-Supervised Representation Learning For Writer Independent Offline Signature Verification", ICIP 2022 (Oral)☆11Feb 17, 2023Updated 3 years ago
- ContextBLIP : Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions [ACL 2024]☆11May 17, 2024Updated 2 years ago
- A collection of AWESOME things about mixture-of-experts☆1,282Dec 8, 2024Updated last year
- Library implementation of "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆40Oct 31, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆717Dec 6, 2025Updated 5 months ago
- Build a Recurrent Neural Network solving Optimization Problems☆10Nov 17, 2021Updated 4 years ago
- [ICCV-2023] Heterogeneous Forgetting Compensation for Class-Incremental Learning☆12Dec 4, 2023Updated 2 years ago
- Implementation for MomentumSMoE☆19Apr 19, 2025Updated last year
- The official repository for the experiments included in the paper titled "Patch-level Routing in Mixture-of-Experts is Provably Sample-ef…☆14Feb 12, 2026Updated 3 months ago
- PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538☆1,243Apr 19, 2024Updated 2 years ago
- Implementation of AAAI 2022 Paper: Go wider instead of deeper☆32Oct 27, 2022Updated 3 years ago
- Official Code for ICML 2023 Paper: On the Generalization of Multi-modal Contrastive Learning☆26Nov 15, 2023Updated 2 years ago
- Knowledge-Guided Adaptation of Pathology Foundation Models Improves Cross-domain Generalization and Demographic Fairness☆17Oct 14, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆56Feb 28, 2023Updated 3 years ago
- [NeurIPS 2022] “M³ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design”, Hanxue …☆136Nov 30, 2022Updated 3 years ago
- ☆12Jun 21, 2022Updated 3 years ago
- Official Implementation for MoPE (T-MM 2025)☆29Oct 10, 2025Updated 7 months ago
- A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models☆859Sep 13, 2023Updated 2 years ago
- ☆10Mar 4, 2024Updated 2 years ago
- The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual C…☆17May 27, 2024Updated 2 years ago
- ☆23Jun 25, 2021Updated 4 years ago
- Code Release for Task Agnostic Dynamics Priors for Deep Reinforcement Learning☆12Jun 13, 2019Updated 6 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code for the paper "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆12Oct 31, 2024Updated last year
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Feb 15, 2023Updated 3 years ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Apr 14, 2025Updated last year
- Find ground breaking 3D point cloud analysis papers☆13Jul 28, 2020Updated 5 years ago
- ☆26Jun 14, 2022Updated 3 years ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆37Feb 10, 2026Updated 3 months ago
- Vehicle registration plate recognition using convolutional neural networks☆11Nov 30, 2022Updated 3 years ago
- ACL 2024 (SRW), Official Codebase of our Paper: "MoExtend: Tuning New Experts for Modality and Task Extension"☆15Dec 3, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆15Jul 9, 2024Updated last year
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- [ECCV'24] cDP-MIL: Robust Multiple Instance Learning via Cascaded Dirichlet Process☆17Sep 10, 2024Updated last year
- [CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features☆86Nov 12, 2024Updated last year
- Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025☆34Feb 22, 2026Updated 3 months ago
- PyTorch implementation of StackGAN paper using BERT embeddings☆12Feb 6, 2022Updated 4 years ago
- code for the paper "Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation" (TPAMI 2021)☆10Jul 15, 2022Updated 3 years ago