jzhang38 / LongMambaView external linksLinks
Some preliminary explorations of Mamba's context scaling.
☆218Feb 8, 2024Updated 2 years ago
Alternatives and similar repositories for LongMamba
Users that are interested in LongMamba are comparing it to the libraries listed below
Sorting:
- ☆35Feb 26, 2024Updated last year
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated last year
- ☆29May 4, 2024Updated last year
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆66Apr 24, 2024Updated last year
- Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"☆248Jun 6, 2025Updated 8 months ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents