ByteDance-Seed / AHNLinks
AHN: Artificial Hippocampus Networks for Efficient Long-Context Modeling
☆166Updated 3 months ago
Alternatives and similar repositories for AHN
Users that are interested in AHN are comparing it to the libraries listed below
Sorting:
- ☆71Updated last week
- ☆82Updated 10 months ago
- Official Repository of Native Parallel Reasoner☆100Updated 3 weeks ago
- Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…☆89Updated last month
- ☆185Updated last year
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆191Updated 7 months ago
- Easy and Efficient dLLM Fine-Tuning☆209Updated 2 weeks ago
- The open-source code of MetaStone-S1.☆105Updated 6 months ago
- SCOPE: Self-evolving Context Optimization via Prompt Evolution - A framework for automatic prompt optimization☆64Updated last month
- [NeurIPS'25 Oral] Query-agnostic KV cache eviction: 3–4× reduction in memory and 2× decrease in latency (Qwen3/2.5, Gemma3, LLaMA3)☆196Updated 3 weeks ago
- Official code repository for Sketch-of-Thought (SoT)☆135Updated 9 months ago
- [NeurIPS 2025] Thinkless: LLM Learns When to Think☆251Updated 4 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆33Updated 5 months ago
- The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning☆331Updated 8 months ago
- A project for tri-modal LLM benchmarking and instruction tuning.☆56Updated 10 months ago
- ☆271Updated last week
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆90Updated 3 months ago
- ☆88Updated 8 months ago
- ☆84Updated 3 months ago
- [ICLR 2026] Geometric-Mean Policy Optimization☆99Updated 2 weeks ago
- GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 tr…☆321Updated 2 months ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆121Updated 8 months ago
- ☆142Updated 3 weeks ago
- The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"☆136Updated 5 months ago
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"☆558Updated 3 months ago
- LIMI: Less is More for Agency☆160Updated 3 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆37Updated last year
- ☆71Updated 3 months ago
- Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge☆104Updated last week
- ☆77Updated 9 months ago