☆29Oct 9, 2024Updated last year
Alternatives and similar repositories for Multi_Head_Mixture_of_Experts__MH-MOE
Users that are interested in Multi_Head_Mixture_of_Experts__MH-MOE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Mar 1, 2025Updated last year
- ☆20Nov 5, 2024Updated last year
- Extremely simple MoE implementation, mostly based off Switch Transformer☆13Feb 26, 2024Updated 2 years ago
- [ICCV 2025] EA-ViT: Efficient Adaptation for Elastic Vision Transformer☆27Jul 28, 2025Updated 10 months ago
- Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch☆31May 11, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICCV'2025] LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation☆53Mar 8, 2026Updated 3 months ago
- sigma-MoE layer☆21Jan 5, 2024Updated 2 years ago
- Code for the ICML 2025 Paper "Product of Experts with LLMs: Boosting Performance on ARC is a Matter of Perspective"☆54Nov 9, 2025Updated 7 months ago
- This is the official implemantation of “Learn-to-Decompose: Cascaded Decomposition Network for Cross-Domain Few-Shot Facial Expression Re…☆18Aug 5, 2022Updated 3 years ago
- Dataset, code and results repository for SBA-Net.☆14Sep 23, 2022Updated 3 years ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆62Nov 5, 2025Updated 7 months ago
- Formal implementation of Robust Domain Misinformation Detection via Multi-modal Feature Alignment☆12Dec 8, 2023Updated 2 years ago
- ☆17Feb 23, 2025Updated last year
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆13Dec 1, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆14Mar 6, 2022Updated 4 years ago
- Code Repository for the NeurIPS 2024 Paper "Toward Efficient Inference for Mixture of Experts".☆19Oct 30, 2024Updated last year
- CVPR2024 highlight.☆13Oct 10, 2024Updated last year
- 图像检索一些好的开源代码☆14Sep 3, 2020Updated 5 years ago
- Official implementation of "ImagineFSL: Self-Supervised Pretraining Matters on Imagined Base Set for VLM-based Few-shot Learning" [CVPR 2…☆29Sep 1, 2025Updated 9 months ago
- ☆22Jul 16, 2024Updated last year
- Kanban board made with TailwindCSS☆11Jun 10, 2021Updated 4 years ago
- Code of paper "EDMB: Edge Detector with Mamba"☆18May 29, 2026Updated last week
- [CVPR 2025] Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation☆35Jul 18, 2025Updated 10 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [IROS 2024] Official code for Towards Dynamic and Small Objects Refinement for Unsupervised Domain Adaptative Nighttime Segmentation.☆16Jun 4, 2024Updated 2 years ago
- RWKV6 in native pytorch and triton:)☆11Aug 4, 2024Updated last year
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated 3 months ago
- ☆10Feb 21, 2023Updated 3 years ago
- [ICLR 2025] Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron☆31Apr 30, 2025Updated last year
- [CVPR 2025] This repository is intended to store the code and data for ASAP (Advancing Semantic Alignment Promotes Multi-Modal Manipulati…☆20Jun 18, 2025Updated 11 months ago
- the implement for "Multi-modal Fake News Detection on Social Media via Multi-grained Information Fusion"☆21Jun 18, 2024Updated last year
- ☆14Jan 22, 2025Updated last year
- A toolset and pipeline for running zero shot and supervised protein fitness prediction, drop in compatible with scikitlearn☆13May 26, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Converts AlphaFold distograms into distance matrices and saves them into a number of formats☆15Dec 13, 2022Updated 3 years ago
- Implementation for Context-Gated Convolution☆60Nov 5, 2021Updated 4 years ago
- Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'☆18Jul 21, 2024Updated last year
- ☆57Mar 17, 2025Updated last year
- ☆10Jun 10, 2023Updated 2 years ago
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆39Jun 11, 2025Updated 11 months ago
- Pytorch Framework learning for deeplearning☆14Jan 2, 2024Updated 2 years ago