Some preliminary explorations of Mamba's context scaling.
☆13Dec 18, 2024Updated last year
Alternatives and similar repositories for LongMamba
Users that are interested in LongMamba are comparing it to the libraries listed below
Sorting:
- GoldFinch and other hybrid transformer components☆12Dec 9, 2025Updated 2 months ago
- GoldFinch and other hybrid transformer components☆45Jul 20, 2024Updated last year
- ☆27Updated this week
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated last year
- MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…☆31Jan 28, 2026Updated last month
- RWKV-RAG个人版☆26Aug 6, 2025Updated 6 months ago
- We have implemented a framework that supports developers to structured prune neural networks of Tensorflow Models☆28Nov 7, 2024Updated last year
- Fast modular code to create and train cutting edge LLMs☆68May 16, 2024Updated last year
- OAuth authentication plugin for personal coding assistance with ChatGPT Plus/Pro subscriptions - uses OpenAI's official authentication me…☆23Feb 3, 2026Updated last month
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆39Mar 11, 2024Updated last year
- test images with not appropriate labels in MNIST dataset☆10Mar 3, 2018Updated 8 years ago
- density growing clustering☆10Dec 2, 2021Updated 4 years ago
- ☆12Feb 24, 2026Updated last week
- Android test project displaying live camera feed in a GLSurfaceView☆10Mar 8, 2015Updated 10 years ago
- Cherry Flowers everywhere☆11Jul 19, 2024Updated last year
- A modern style viewer for Dannbooru or other Booru API base site.☆13May 23, 2025Updated 9 months ago
- A convenient primitive for creating, structing and throwing errors☆13Oct 26, 2025Updated 4 months ago
- Codes uploaded while following The Math of Intelligence course by Siraj Raval on Youtube☆11Jul 15, 2019Updated 6 years ago
- Transform the collected configurations into various target configurations, inheriting the art of simplifying complexity. Additionally, it…☆14May 3, 2025Updated 10 months ago
- ☆14Jan 24, 2025Updated last year
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Mar 20, 2025Updated 11 months ago
- ☆13May 21, 2023Updated 2 years ago
- ME-GraphAU on Video☆11May 10, 2024Updated last year
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆13Jan 2, 2024Updated 2 years ago
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- EXL2 quantization generalized to other models.☆10Mar 17, 2024Updated last year
- a Nix-based C preprocessor☆16Aug 11, 2024Updated last year
- Mini Model Daemon☆12Nov 9, 2024Updated last year
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 10 months ago
- RWKV Wiki website (archived, please visit official wiki)☆11Mar 26, 2023Updated 2 years ago
- 基于python实现的桌面视频动态壁纸引擎☆10Jun 2, 2022Updated 3 years ago
- Remake DarkColony (1997) on OpenRA platform☆13Mar 23, 2022Updated 3 years ago
- EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets☆10Dec 12, 2023Updated 2 years ago
- ☆17May 17, 2024Updated last year
- ☆12Dec 14, 2024Updated last year
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 7 months ago
- Hacks helping with semi-almost-usable declarative NixOS sandboxing☆12Aug 14, 2024Updated last year
- my nix packages☆10Feb 19, 2026Updated last week
- Reinforcement Learning Toolkit for RWKV.(v6,v7,ARWKV) Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning. Exploring the…☆62Sep 19, 2025Updated 5 months ago