PyTorch implementation of LIMoE
☆52Apr 1, 2024Updated 2 years ago
Alternatives and similar repositories for LIMoE-pytorch
Users that are interested in LIMoE-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with…☆37Apr 13, 2026Updated 3 weeks ago
- ContextBLIP : Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions [ACL 2024]☆11May 17, 2024Updated last year
- A collection of AWESOME things about mixture-of-experts☆1,275Dec 8, 2024Updated last year
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 5 months ago
- Library implementation of "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆40Oct 31, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆717Dec 6, 2025Updated 5 months ago
- Build a Recurrent Neural Network solving Optimization Problems☆10Nov 17, 2021Updated 4 years ago
- Implementation for MomentumSMoE☆19Apr 19, 2025Updated last year
- [ICLR 2025] CAMEx: Curvature-Aware Merging of Experts☆24Mar 1, 2025Updated last year
- SegEval is a Python library that provides tools for evaluating semantic segmentation models☆14Jun 29, 2023Updated 2 years ago
- This repo contains the code of "Contrastive Supervised Distillation for Continual Representation Learning", Tommaso Barletti, Niccolò Bio…☆20Jul 5, 2022Updated 3 years ago
- PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538☆1,242Apr 19, 2024Updated 2 years ago
- Histopathology Feature Extractors (2024)☆14Jun 14, 2024Updated last year
- Implementation of AAAI 2022 Paper: Go wider instead of deeper☆32Oct 27, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆13Sep 18, 2019Updated 6 years ago
- ☆95Apr 3, 2023Updated 3 years ago
- Knowledge-Guided Adaptation of Pathology Foundation Models Improves Cross-domain Generalization and Demographic Fairness☆17Oct 14, 2025Updated 6 months ago
- Official Code for ICML 2023 Paper: On the Generalization of Multi-modal Contrastive Learning☆26Nov 15, 2023Updated 2 years ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆56Feb 28, 2023Updated 3 years ago
- [NeurIPS 2022] “M³ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design”, Hanxue …☆136Nov 30, 2022Updated 3 years ago
- ☆12Jun 21, 2022Updated 3 years ago
- Official Implementation for MoPE (T-MM 2025)☆28Oct 10, 2025Updated 7 months ago
- Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch☆383Jun 17, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models☆859Sep 13, 2023Updated 2 years ago
- ☆10Mar 4, 2024Updated 2 years ago
- This is the official implementation of the IndexNet.☆11May 23, 2023Updated 2 years ago
- The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual C…☆17May 27, 2024Updated last year
- Official code for the ICLR 2025 paper, "Ada-K Routing: Boosting the Efficiency of MoE-based LLMs"☆12Mar 1, 2025Updated last year
- Code Release for Task Agnostic Dynamics Priors for Deep Reinforcement Learning☆12Jun 13, 2019Updated 6 years ago
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Feb 15, 2023Updated 3 years ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Apr 14, 2025Updated last year
- Find ground breaking 3D point cloud analysis papers☆13Jul 28, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆26Jun 14, 2022Updated 3 years ago
- Code for paper: Unraveling the Shift of Visual Information Flow in MLLMs: From Phased Interaction to Efficient Inference☆13Jun 7, 2025Updated 11 months ago
- Use Siamese Network to implement fingerprint verification task.☆11Oct 21, 2021Updated 4 years ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- ☆14Jul 4, 2022Updated 3 years ago
- ☆12Apr 25, 2022Updated 4 years ago
- ACL 2024 (SRW), Official Codebase of our Paper: "MoExtend: Tuning New Experts for Modality and Task Extension"☆15Dec 3, 2024Updated last year