The official implementation of Bi-Mamba
☆15Oct 22, 2025Updated 5 months ago
Alternatives and similar repositories for Bi-Mamba
Users that are interested in Bi-Mamba are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Pytorch Implementation of Paper "DarwinLM: Evolutionary Structured Pruning of Large Language Models"☆20Feb 21, 2025Updated last year
- Elucidated Dataset Condensation (NeurIPS 2024)☆20Oct 5, 2024Updated last year
- (CVPR 2025) Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA…☆27Aug 23, 2025Updated 7 months ago
- [NeurIPS 2025] The first web-based benchmark and platform to evaluate visual reasoning and interaction capabilities of MLLM powered agent…☆61Feb 19, 2026Updated 2 months ago
- FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation☆51Aug 24, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆53Jan 31, 2026Updated 2 months ago
- [ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models☆43Oct 28, 2025Updated 5 months ago
- ☆12Oct 17, 2023Updated 2 years ago
- Ludo game made with jquery☆11Jul 19, 2023Updated 2 years ago
- pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsup…☆18Mar 23, 2020Updated 6 years ago
- [ACL2026 Main] AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts☆73Jan 23, 2026Updated 2 months ago
- An implementation of LazyLLM token pruning for LLaMa 2 model family.☆13Jan 6, 2025Updated last year
- ☆12Aug 22, 2023Updated 2 years ago
- A small game in react.js☆15Oct 8, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Generative Models for Low Rank Video Representation and Reconstruction☆10May 20, 2019Updated 6 years ago
- Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆15May 10, 2024Updated last year
- Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"☆66Sep 6, 2022Updated 3 years ago
- ☆15Mar 18, 2025Updated last year
- PyTorch implementation of the "Learning an Adaptive Learning Rate Schedule" paper found here: https://arxiv.org/abs/1909.09712.☆12Jan 15, 2020Updated 6 years ago
- [NeurIPS25 & ICML25 Workshop on Reliable and Responsible Foundation Models] A Simple Baseline Achieving Over 90% Success Rate Against the…☆91Feb 3, 2026Updated 2 months ago
- PCA-SVD-Autoencoder-Fourier-Wavelet-Transformation-for-denoising☆22Feb 16, 2022Updated 4 years ago
- this repo attemps to reproduce DSOD: Learning Deeply Supervised Object Detectors from Scratch use gluon reimplementation☆14Aug 18, 2018Updated 7 years ago
- Softened ROSA QKV Operators for Training Next-Generation LLM Models☆36Apr 7, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The official implementation of "DDR-Net: Learning Multi-Stage Multi-View Stereo With Dynamic Depth Range"☆24Aug 25, 2021Updated 4 years ago
- Code for reproducing the results in "Forecasting Human Dynamics from Static Images"☆13Jun 16, 2024Updated last year
- عمق☆16May 7, 2015Updated 10 years ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆34Feb 5, 2023Updated 3 years ago
- [ICLR 2026] Optimization-free Dataset Distillation for Object Detection. Paper at: https://arxiv.org/abs/2506.01942☆29Jan 26, 2026Updated 2 months ago
- a language☆17Jun 17, 2022Updated 3 years ago
- YouTube Assistant☆12May 15, 2023Updated 2 years ago
- Paper, dataset and code list for multimodal dialogue.☆22Jan 2, 2025Updated last year
- ☆26Mar 20, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- MICRO 2024 Evaluation Artifact for FuseMax☆17Aug 26, 2024Updated last year
- Implementation of Pythia: A Customizable Hardware Prefetching Framework Using Online Reinforcement Learning in Chisel HDL. To know more, …☆17Oct 9, 2021Updated 4 years ago
- i-mae Pytorch Repo☆20Apr 6, 2024Updated 2 years ago
- [ACL 2026] Psyche-R1 (Chinese Psychological Reasoning LLM)☆25Updated this week
- [NeurIPS 2024] DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning☆55Nov 9, 2025Updated 5 months ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year