Weixin-Liang/Mixture-of-Mamba

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Weixin-Liang/Mixture-of-Mamba)

Weixin-Liang / Mixture-of-Mamba

☆51

Alternatives and similar repositories for Mixture-of-Mamba

Users that are interested in Mixture-of-Mamba are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ml-jku / plstm_experiments
View on GitHub
☆16Oct 21, 2025Updated 8 months ago
zhang677 / PCL-lite
View on GitHub
[ICML 2025] Adaptive Self-improvement LLM Agentic System for ML Library Development
☆17Jan 6, 2026Updated 6 months ago
microsoft / x-reasoner
View on GitHub
X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains
☆49Feb 4, 2026Updated 5 months ago
YueZhan721 / MambaSOD
View on GitHub
☆15Jun 22, 2026Updated 3 weeks ago
krafton-ai / lexico
View on GitHub
KV cache compression via sparse coding
☆17Oct 26, 2025Updated 8 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
kiaia / GIRAFFE
View on GitHub
Extending context length of visual language models
☆12Dec 18, 2024Updated last year
formll / resolving-scaling-law-discrepancies
View on GitHub
☆19Nov 4, 2025Updated 8 months ago
MaitySubhajit / KArAt
View on GitHub
Kolmogorov-Arnold Attention: Is Learnable Attention Better for Vision Transformers?
☆16Jul 9, 2025Updated last year
feizc / PNAIC
View on GitHub
Partially Non-Autoregressive Image Captioning
☆10Sep 30, 2021Updated 4 years ago
sinahmr / LocAtViT
View on GitHub
PyTorch Implementation of LocAtViT in "Locality-Attending Vision Transformer" (ICLR 2026)
☆18Mar 10, 2026Updated 4 months ago
Wuuu3511 / LAMVSNET
View on GitHub
Boosting Multi-view Stereo with Late Cost Aggregation
☆13Jan 24, 2024Updated 2 years ago
mmcdermott / How-to-PhD
View on GitHub
A collection of resources and information for concrete skills that are helpful when pursuing a PhD in computer science (specifically in M…
☆23Apr 18, 2023Updated 3 years ago
tyler-romero / microR1
View on GitHub
Simple repository for training small reasoning models
☆51Feb 17, 2026Updated 5 months ago
wangf3014 / Patch_Scaling
View on GitHub
Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More
☆25Feb 25, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
JayaniP / Multi_Agent-LLM
View on GitHub
Enhancing Multi-Agent System Coordination in Autonomous Electric Vehicles Using Large Language Models
☆21Dec 13, 2023Updated 2 years ago
openmedlab / Data-Centric-FM-Healthcare
View on GitHub
☆30Oct 8, 2024Updated last year
XiaoduoAILab / XmodelLM
View on GitHub
XmodelLM
☆38Nov 19, 2024Updated last year
BryceZhuo / PolyCom
View on GitHub
The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".
☆18Apr 25, 2025Updated last year
The-Swarm-Corporation / Mamba-R1
View on GitHub
Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…
☆25Oct 13, 2025Updated 9 months ago
TianjinYellow / StableSPAM
View on GitHub
☆28Jul 2, 2026Updated 2 weeks ago
ShaderManager / RetNet
View on GitHub
PyTorch implementation of Retentive Network: A Successor to Transformer for Large Language Models
☆14Jul 20, 2023Updated 3 years ago
Gen-Verse / Diffusion-Sharpening
View on GitHub
Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening
☆72May 18, 2025Updated last year
abrvkh / explainability_toolkit
View on GitHub
☆14Dec 12, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
koalazf99 / nanoverl
View on GitHub
Collections of RLxLM experiments using minimal codes
☆14Feb 17, 2025Updated last year
feizc / Vespa
View on GitHub
Video Diffusion State Space Models
☆19Mar 27, 2024Updated 2 years ago
lok-18 / A2RNet
View on GitHub
AAAI 2025 | A2RNet: Adversarial Attack Resilient Network for Robust Infrared and Visible Image Fusion
☆33Oct 10, 2025Updated 9 months ago
ZhengYu518 / VL-Mamba
View on GitHub
Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"
☆86Mar 21, 2024Updated 2 years ago
2U1 / DINOv2-Finetune
View on GitHub
An open-source implementaion for fine-tuning DINOv2 by Meta.
☆15Jul 21, 2025Updated last year
BoXiao123 / simple-chinese-ocr-with-opencv
View on GitHub
☆10Apr 8, 2018Updated 8 years ago
mikecvet / beam
View on GitHub
LLM Beam Search Example Implementation
☆13May 3, 2024Updated 2 years ago
HUANGLIZI / MMFundus
View on GitHub
This repository is the official data collection of MMFundus (Multimodal Fundus) dataset.
☆13Feb 2, 2026Updated 5 months ago
NX-AI / mlstm_kernels
View on GitHub
Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.
☆90Jul 6, 2026Updated 2 weeks ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
LiaoEuan / SincAlignNet
View on GitHub
This implementation is based on the SincAlignNet model from the paper 'Frequency-Based Alignment of EEG and Audio Signals Using Contrasti…
☆14Jul 28, 2025Updated 11 months ago
naver-ai / lut
View on GitHub
[ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"
☆14Dec 1, 2024Updated last year
amazon-science / object-centric-vol
View on GitHub
☆13Apr 3, 2024Updated 2 years ago
sunblaze-ucb / reasoning_ladder
View on GitHub
☆35May 16, 2025Updated last year
Zcchill / Value-Residual-Learning
View on GitHub
☆15Mar 20, 2025Updated last year
rina-ding / gat-mamba
View on GitHub
Combining Graph Neural Network and Mamba to Capture Local and Global Tissue Spatial Relationships in Whole Slide Images
☆37Jun 3, 2025Updated last year
SHI-Labs / VisPer-LM
View on GitHub
[NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation
☆73Oct 17, 2025Updated 9 months ago