The-Swarm-Corporation/Mamba-R1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/The-Swarm-Corporation/Mamba-R1)

The-Swarm-Corporation / Mamba-R1

Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Experts (MoE).

☆25

Alternatives and similar repositories for Mamba-R1

Users that are interested in Mamba-R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

The-Swarm-Corporation / Brainwave
View on GitHub
Brainwave is a state-of-the-art neural decoder that transforms electroencephalogram (EEG) and brain signals into multimodal outputs inclu…
☆14Oct 6, 2025Updated 9 months ago
The-Swarm-Corporation / HTX-Swarm
View on GitHub
A sophisticated multi-agent system designed for real-time market analysis of HTX (formerly Huobi) exchange data. This swarm combines spec…
☆11Mar 18, 2025Updated last year
The-Swarm-Corporation / AgentGym
View on GitHub
A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1
☆24Oct 13, 2025Updated 9 months ago
The-Swarm-Corporation / Brain2Qwerty
View on GitHub
An implementation of the paper Brain2Qwerty that translates brain EEG data into text for reading people's brains. There was no code so we…
☆25Feb 9, 2025Updated last year
kyegomez / dev-swarm
View on GitHub
A swarm of LLM agents that will help you test, document, and productionize your code!
☆19Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
longrongyang / STGC
View on GitHub
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
☆13Feb 11, 2025Updated last year
kyegomez / MoE-Mamba
View on GitHub
Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…
☆132Updated this week
Bigyehahaha / M4
View on GitHub
The code of 《M4: Multi-Proxy Multi-Gate Mixture of Experts Network for Multiple Instance Learning in Histopathology Image Analysis》
☆14Mar 31, 2025Updated last year
The-Swarm-Corporation / SwarmOS
View on GitHub
Traditional operating systems are reactive - they wait for user input or system events before taking action. SwarmOS breaks this paradigm…
☆15Dec 6, 2024Updated last year
kyegomez / FastFF
View on GitHub
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆16Nov 11, 2024Updated last year
zdebruine / MMVAE
View on GitHub
Mixture-of-Experts Multimodal Variational Autoencoder
☆15Jul 3, 2025Updated last year
The-Swarm-Corporation / AgentOS
View on GitHub
AgentOS is a lightweight, single-file implementation that provides a robust foundation for building autonomous AI agents. It implements t…
☆25Jul 11, 2025Updated last year
The-Swarm-Corporation / swarm-models
View on GitHub
A simple to use package to call various model providers such as openai, anthropic, and others with utmost reliability, security, and perf…
☆13Oct 6, 2025Updated 9 months ago
kuanhenglin / ddim-inversion
View on GitHub
UCLA CS 188 (Winter 2023) course project.
☆12Mar 31, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kyegomez / SoundStream
View on GitHub
Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"
☆13Jan 27, 2025Updated last year
kyegomez / TTL
View on GitHub
Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"
☆23Updated this week
he-h / ST-MoE-BERT
View on GitHub
This repository contains the code for the paper "ST-MoE-BERT: A Spatial-Temporal Mixture-of-Experts Framework for Long-Term Cross-City Mo…
☆16Feb 20, 2025Updated last year
mr-azharul / nestjs-boilerplate
View on GitHub
A scalable, clean architecture, ready-to-use NestJs boilerplate
☆11May 9, 2024Updated 2 years ago
kyegomez / Pairformer
View on GitHub
Implementation of the Pairformer model used in AlphaFold 3
☆14Updated this week
ChenZiHong-Gavin / MoE-Visualizer
View on GitHub
MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.
☆16Apr 8, 2025Updated last year
cfcooney / Imagined-Speech-EEG-Matlab
View on GitHub
Data extraction and processing of EEG trials corresponding to imagined speech. The dataset has been acquired from: http://www.cs.toronto.…
☆10Mar 10, 2018Updated 8 years ago
ThomasRochefortB / torch-gato
View on GitHub
Pytorch implementation of the Gato paper from Deepmind
☆12Feb 8, 2023Updated 3 years ago
Taishi-N324 / Drop-Upcycling
View on GitHub
[ICLR 2025] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization
☆25Oct 5, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kyegomez / SimpleMamba
View on GitHub
Implementation of a modular, high-performance, and simplistic mamba for high-speed applications
☆41Nov 11, 2024Updated last year
kyegomez / OmniByteFormer
View on GitHub
OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…
☆15Updated this week
kyegomez / TinyGPTV
View on GitHub
Simple Implementation of TinyGPTV in super simple Zeta lego blocks
☆16Nov 11, 2024Updated last year
LAION-AI / worldsim
View on GitHub
☆13Aug 29, 2023Updated 2 years ago
kyegomez / OpenR1
View on GitHub
An open source implementation of R1
☆30Updated this week
ttw1018 / MoPE-DST
View on GitHub
The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"
☆19Jan 25, 2025Updated last year
kyegomez / AutoRT
View on GitHub
Implementation of AutoRT: "AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents"
☆44Nov 11, 2024Updated last year
matthewrice345 / Flutter-Architecture-Demo
View on GitHub
A sample app to demonstrate a scalable architecture in Flutter
☆10Aug 26, 2019Updated 6 years ago
The-Swarm-Corporation / HospitalSim
View on GitHub
HospitalSim is a sophisticated multi-agent hospital management and simulation system designed to optimize healthcare operations through m…
☆29Jan 19, 2026Updated 6 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Prismadic / simple-CMamba
View on GitHub
An unofficial, simple pytorch implementation of the paper "C-Mamba: Channel Correlation Enhanced State Space Models for Multivariate Time…
☆18Jul 19, 2024Updated 2 years ago
kyegomez / TeraGPT
View on GitHub
Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT
☆17Updated this week
lucidrains / grokfast-pytorch
View on GitHub
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
☆104Dec 22, 2024Updated last year
7tl7qns7ch / IPOT
View on GitHub
Inducing Point Operator Transformer: A Flexible and Scalable Architecture for Solving PDEs (AAAI 2024)
☆14Jul 30, 2024Updated last year
joshnuss / threlte-starter
View on GitHub
Starter template for SvelteKit + Threlte projects
☆11Dec 27, 2022Updated 3 years ago
caojiaolong / Awesome-Mamba
View on GitHub
Collect papers about Mamba (a selective state space model).
☆15Aug 6, 2024Updated last year
kyegomez / CogNetX
View on GitHub
CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…
☆20Updated this week