☆50Jan 28, 2025Updated last year
Alternatives and similar repositories for Mixture-of-Mamba
Users that are interested in Mixture-of-Mamba are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2025] Adaptive Self-improvement LLM Agentic System for ML Library Development☆17Jan 6, 2026Updated 5 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Feb 4, 2026Updated 4 months ago
- ☆19Nov 4, 2025Updated 7 months ago
- LoG-VMamba: Local-Global Vision Mamba for Medical Image Segmentation☆28Oct 18, 2024Updated last year
- [ICLR'25] ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts☆53Nov 20, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- Histomic Prognostic Signature (HiPS): A population-level computational histologic signature for invasive breast cancer prognosis☆33Apr 9, 2024Updated 2 years ago
- Differential equation neural operator☆22Sep 4, 2023Updated 2 years ago
- Dynamic config system based on python classes☆12Jan 27, 2023Updated 3 years ago
- ActSort is an active learning accelerated cell sorter tool for calcium imaging.☆26Apr 27, 2026Updated last month
- Boosting Multi-view Stereo with Late Cost Aggregation☆13Jan 24, 2024Updated 2 years ago
- A collection of resources and information for concrete skills that are helpful when pursuing a PhD in computer science (specifically in M…☆23Apr 18, 2023Updated 3 years ago
- Simple repository for training small reasoning models☆52Feb 17, 2026Updated 3 months ago
- #UAI2020 Codes for PAC-Bayesian Contrastive Unsupervised Representation Learning☆14May 23, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆25Feb 25, 2025Updated last year
- [CVPR 2025] 2DMamba: Efficient State Space Model for Image Representation☆84Jan 29, 2026Updated 4 months ago
- Decoding of the speech envelope from EEG using the VLAAI deep neural network☆14Sep 28, 2022Updated 3 years ago
- This is the offical repository for "Multi-modal Gated Mixture of Local-to-Global Experts for Dynamic Image Fusion" (ICCV 2023).☆75Apr 30, 2024Updated 2 years ago
- The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".☆18Apr 25, 2025Updated last year
- ☆21Apr 30, 2023Updated 3 years ago
- XmodelLM☆38Nov 19, 2024Updated last year
- An open-source implementaion for fine-tuning DINOv2 by Meta.☆14Jul 21, 2025Updated 10 months ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆37Aug 27, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PyTorch Implementation of "BOOTPLACE: Bootstrapped Object Placement with Detection Transformers", CVPR 2025☆28May 18, 2026Updated 3 weeks ago
- Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.☆114Jan 14, 2026Updated 4 months ago
- ☆45Mar 31, 2025Updated last year
- Segment Anything with Webcam in Real-Time with FastSAM☆10Nov 19, 2023Updated 2 years ago
- Code for the paper "Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers" [ICCV 2025]☆104Jul 28, 2025Updated 10 months ago
- Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…☆24Oct 13, 2025Updated 7 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆73May 18, 2025Updated last year
- A mini-redis learn from tokio.☆12Dec 20, 2022Updated 3 years ago
- [CVPR 2025] CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning☆47Apr 21, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- PyTorch implementation of Retentive Network: A Successor to Transformer for Large Language Models☆14Jul 20, 2023Updated 2 years ago
- ☆61May 13, 2025Updated last year
- Code for "StreamingTalker: Audio-driven 3D Facial Animation with Autoregressive Diffusion Model", AAAI2026 Oral☆55Jan 16, 2026Updated 4 months ago
- Efficient Computation and Analysis of Distributional Shapley Values (AISTATS 2021)☆22Oct 19, 2023Updated 2 years ago
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆86Mar 21, 2024Updated 2 years ago
- ☆14Dec 12, 2024Updated last year
- ☆17Feb 23, 2025Updated last year