Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks"
☆20Mar 28, 2026Updated 2 weeks ago
Alternatives and similar repositories for MambaFormer
Users that are interested in MambaFormer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆17Nov 11, 2024Updated last year
- A vast array of Multi-Modal Embodied Robotic Foundation Models!☆28Mar 18, 2024Updated 2 years ago
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta☆125Updated this week
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆126Mar 22, 2026Updated 3 weeks ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆27Mar 30, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of AutoRT: "AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents"☆43Nov 11, 2024Updated last year
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Oct 13, 2025Updated 6 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Nov 11, 2024Updated last year
- A simpler Pytorch + Zeta Implementation of the paper: "SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series…☆29Nov 11, 2024Updated last year
- iDeepV: predicting RBP binding sites using vector representation learned from sequences with a CNN.☆10Jul 16, 2019Updated 6 years ago
- The real GPT-4 with image access (You probably don't have access)☆12Mar 17, 2023Updated 3 years ago
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Apr 6, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Hierarchical Approach for Generating Descriptive Image Paragraphs☆10Mar 27, 2020Updated 6 years ago
- ☆18Apr 2, 2023Updated 3 years ago
- Minimal Mamba-2 implementation in PyTorch☆247Jun 17, 2024Updated last year
- This repository contains all DeepCLIP Python code☆11May 30, 2024Updated last year
- This is my paper.☆11Jun 24, 2023Updated 2 years ago
- Official implementation of the paper - GD-Retriever: Controllable generative text-music retrieval with diffusion models (Accepted at ISMI…☆17Sep 25, 2025Updated 6 months ago
- Implementation of "Do As I Can, Not As I Say: Grounding Language in Robotic Affordances" by Google☆24Mar 22, 2026Updated 3 weeks ago
- [NeurIPS 2023] Neural Lyapunov Control for Discrete-Time Systems☆14Jul 29, 2024Updated last year
- Jax code for functional genomics ML☆14Mar 5, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A dataset of 173 progressive metal songs, in both GuitarPro and token formats, as per the specifications in DadaGP.☆17Nov 19, 2024Updated last year
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆46May 23, 2023Updated 2 years ago
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆27Oct 13, 2025Updated 6 months ago
- minimalistic AI library that resembles HF's transformers☆13Dec 31, 2024Updated last year
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆218Mar 22, 2026Updated 3 weeks ago
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆27Mar 13, 2026Updated last month
- This is the official implementation of RL-Chord (TNNLS).☆13Jan 2, 2024Updated 2 years ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Jan 21, 2025Updated last year
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆209Mar 13, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆56Oct 27, 2025Updated 5 months ago
- Implementation of PyTorch: "GAMBA: MARRY GAUSSIAN SPLATTING WITH MAMBA FOR SINGLE-VIEW 3D RECONSTRUCTION"☆65Oct 6, 2025Updated 6 months ago
- Encryption and signing for a post quantum world☆17Apr 4, 2023Updated 3 years ago
- time-series☆17Jan 6, 2024Updated 2 years ago
- ChatEMG: Synthetic Data Generation to Control a Robotic Hand Orthosis for Stroke☆10Jul 2, 2024Updated last year
- A LeapJS compatible server for Ultraleap Gemini V5 and above☆14Feb 19, 2025Updated last year
- Pytorch (Lightning) implementation of the Mamba model☆37Apr 18, 2025Updated 11 months ago