Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks"
☆21Apr 20, 2026Updated 2 weeks ago
Alternatives and similar repositories for MambaFormer
Users that are interested in MambaFormer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2☆15Jun 27, 2025Updated 10 months ago
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta☆126Apr 13, 2026Updated 3 weeks ago
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆127Apr 13, 2026Updated 2 weeks ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆28Apr 20, 2026Updated last week
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆232Oct 16, 2025Updated 6 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Oct 13, 2025Updated 6 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- A simpler Pytorch + Zeta Implementation of the paper: "SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series…☆29Nov 11, 2024Updated last year
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆16Apr 13, 2026Updated 3 weeks ago
- A Hierarchical Approach for Generating Descriptive Image Paragraphs☆10Mar 27, 2020Updated 6 years ago
- Official implementation of the paper - GD-Retriever: Controllable generative text-music retrieval with diffusion models (Accepted at ISMI…☆17Sep 25, 2025Updated 7 months ago
- Implementation of "Do As I Can, Not As I Say: Grounding Language in Robotic Affordances" by Google☆25Apr 13, 2026Updated 2 weeks ago
- (BMVC 2020 Oral) Neighbourhood-Insensitive Point Cloud Normal Estimation Network☆10Jun 30, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A dataset of 173 progressive metal songs, in both GuitarPro and token formats, as per the specifications in DadaGP.☆17Nov 19, 2024Updated last year
- minimalistic AI library that resembles HF's transformers☆13Dec 31, 2024Updated last year
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆220Apr 13, 2026Updated 3 weeks ago
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆27Apr 13, 2026Updated 3 weeks ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Jan 21, 2025Updated last year
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆214Apr 13, 2026Updated 3 weeks ago
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆56Oct 27, 2025Updated 6 months ago
- [CVPR 2019] Official Matlab implementation of OSD: Unsupervised image matching and object discovery as optimization.☆12Nov 4, 2021Updated 4 years ago
- Implementation of PyTorch: "GAMBA: MARRY GAUSSIAN SPLATTING WITH MAMBA FOR SINGLE-VIEW 3D RECONSTRUCTION"☆65Oct 6, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- time-series☆17Jan 6, 2024Updated 2 years ago
- The official implementation of the EMNLP 2023 paper "Paraphrase Types for Generation and Detection"☆12Oct 20, 2024Updated last year
- Pytorch (Lightning) implementation of the Mamba model☆37Apr 18, 2025Updated last year
- A simple and efficient Mamba implementation in pure PyTorch and MLX.☆1,455Jan 26, 2026Updated 3 months ago
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆86Mar 21, 2024Updated 2 years ago
- MiranDa: Mimicking the learning process of human doctors to achieve causal inference for medication recommendation☆13Jan 19, 2026Updated 3 months ago
- An implementation of the Anthropic's paper and essay on "A statistical approach to model evaluations"☆16Oct 6, 2025Updated 6 months ago
- [Neurips 2024] This repository is the official implementation of the Spatio-hemispherical equivariant convolution for dMRI deconvolution …☆11Dec 24, 2024Updated last year
- ☆14Jul 26, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Harmonization of multi-site, multi-shell diffusion MRI☆14Oct 16, 2025Updated 6 months ago
- A public repository for ConDo (AAAI25 accepted)☆10Dec 21, 2024Updated last year
- Rust SDK and CLI for Swarm Framework with Multi-Agent Orchestration☆16Apr 10, 2025Updated last year
- Finetune any model on HF in less than 30 seconds☆57Updated this week
- ☆15Apr 26, 2025Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- ☆13Jul 23, 2024Updated last year