The official implementation of Bi-Mamba
☆14Oct 22, 2025Updated 5 months ago
Alternatives and similar repositories for Bi-Mamba
Users that are interested in Bi-Mamba are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Pytorch Implementation of Paper "DarwinLM: Evolutionary Structured Pruning of Large Language Models"☆20Feb 21, 2025Updated last year
- Elucidated Dataset Condensation (NeurIPS 2024)☆20Oct 5, 2024Updated last year
- (CVPR 2025) Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA…☆27Aug 23, 2025Updated 7 months ago
- [NeurIPS 2025] The first web-based benchmark and platform to evaluate visual reasoning and interaction capabilities of MLLM powered agent…☆60Feb 19, 2026Updated last month
- FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation☆51Aug 24, 2025Updated 7 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆51Jan 31, 2026Updated last month
- [ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models☆43Oct 28, 2025Updated 5 months ago
- ☆12Oct 17, 2023Updated 2 years ago
- pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsup…☆18Mar 23, 2020Updated 6 years ago
- AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts☆69Jan 23, 2026Updated 2 months ago
- An implementation of LazyLLM token pruning for LLaMa 2 model family.☆13Jan 6, 2025Updated last year
- ☆12Aug 22, 2023Updated 2 years ago
- Generative Models for Low Rank Video Representation and Reconstruction☆10May 20, 2019Updated 6 years ago
- Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆15May 10, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"☆66Sep 6, 2022Updated 3 years ago
- ☆15Mar 18, 2025Updated last year
- PyTorch implementation of the "Learning an Adaptive Learning Rate Schedule" paper found here: https://arxiv.org/abs/1909.09712.☆12Jan 15, 2020Updated 6 years ago
- [NeurIPS25 & ICML25 Workshop on Reliable and Responsible Foundation Models] A Simple Baseline Achieving Over 90% Success Rate Against the…☆90Feb 3, 2026Updated last month
- PCA-SVD-Autoencoder-Fourier-Wavelet-Transformation-for-denoising☆22Feb 16, 2022Updated 4 years ago
- this repo attemps to reproduce DSOD: Learning Deeply Supervised Object Detectors from Scratch use gluon reimplementation☆14Aug 18, 2018Updated 7 years ago
- The official implementation of "DDR-Net: Learning Multi-Stage Multi-View Stereo With Dynamic Depth Range"☆24Aug 25, 2021Updated 4 years ago
- Code for reproducing the results in "Forecasting Human Dynamics from Static Images"☆13Jun 16, 2024Updated last year
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆34Feb 5, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- عمق☆16May 7, 2015Updated 10 years ago
- MICRO 2024 Evaluation Artifact for FuseMax☆16Aug 26, 2024Updated last year
- [ICLR 2026] Optimization-free Dataset Distillation for Object Detection. Paper at: https://arxiv.org/abs/2506.01942☆28Jan 26, 2026Updated 2 months ago
- a language☆17Jun 17, 2022Updated 3 years ago
- YouTube Assistant☆12May 15, 2023Updated 2 years ago
- Paper, dataset and code list for multimodal dialogue.☆22Jan 2, 2025Updated last year
- ☆26Mar 20, 2024Updated 2 years ago
- Psyche-R1 (Chinese Psychological Reasoning LLM)☆25Oct 17, 2025Updated 5 months ago
- Implementation of Pythia: A Customizable Hardware Prefetching Framework Using Online Reinforcement Learning in Chisel HDL. To know more, …☆17Oct 9, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [NeurIPS 2024] DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning☆54Nov 9, 2025Updated 4 months ago
- i-mae Pytorch Repo☆20Apr 6, 2024Updated last year
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- ☆14Mar 8, 2025Updated last year
- The implementation of our paper accepted by ACL 2023: Facilitating Multi-turn Emotional Support Conversation with Positive Emotion Elicit…☆23Jul 16, 2023Updated 2 years ago
- Re:Phiedit 指南:通过重构 RPE 说明书结构,优化阅读体验,减少 RPE 说明书的理解成本☆16Apr 16, 2023Updated 2 years ago