ZihaoHuang-notabot/Ultra-Sparse-Memory-Network

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZihaoHuang-notabot/Ultra-Sparse-Memory-Network)

ZihaoHuang-notabot / Ultra-Sparse-Memory-Network

☆48

Alternatives and similar repositories for Ultra-Sparse-Memory-Network

Users that are interested in Ultra-Sparse-Memory-Network are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SakanaAI / fast-weight-product-key-memory
View on GitHub
Code for Fast-weight Product Key Memory (FwPKM)
☆19Mar 18, 2026Updated 4 months ago
sail-sg / SkyLadder
View on GitHub
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆43Dec 29, 2025Updated 7 months ago
assafbk / OPRM
View on GitHub
Overflow Prevention Enhances Long-Context Recurrent LLMs (COLM 2025)
☆18Jul 8, 2025Updated last year
bcml-labs / rosa-plus
View on GitHub
ROSA+: RWKV's ROSA implementation with fallback statistical predictor
☆36Oct 13, 2025Updated 9 months ago
wz1119 / KromHC
View on GitHub
[ICML 2026] Implementation for KromHC: Manifold-Constrained Hyper-Connections with Kronecker-Product Residual Matrices
☆15Jul 13, 2026Updated 2 weeks ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
hanningzhang / ER-PRM
View on GitHub
☆20Dec 14, 2024Updated last year
goombalab / Gather-and-Aggregate
View on GitHub
Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"
☆16Apr 30, 2025Updated last year
thunlp / SparsingLaw
View on GitHub
The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".
☆32Nov 12, 2024Updated last year
chen-hao-chao / mdm-prime-v2
View on GitHub
MDM-Prime-v2: Binary Encoding and Index Shuffling Enable Scaling of Diffusion Language Models
☆27May 23, 2026Updated 2 months ago
wjie98 / rosa_soft
View on GitHub
Softened ROSA QKV Operators for Training Next-Generation LLM Models
☆39Jun 26, 2026Updated last month
vincentamato / mlx-esm-2
View on GitHub
An MLX implementation of Meta AI's ESM-2 protein language model
☆16Aug 16, 2025Updated 11 months ago
longrongyang / STGC
View on GitHub
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
☆13Feb 11, 2025Updated last year
sustcsonglin / fla-tilelang
View on GitHub
☆37Mar 7, 2025Updated last year
HazyResearch / scaling-verification
View on GitHub
☆26Sep 4, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Zyphra / zcookbook
View on GitHub
Training hybrid models for dummies.
☆31Nov 1, 2025Updated 8 months ago
yunxiangfu2001 / LaMamba-Diff
View on GitHub
LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba (Official Implementation)
☆17Oct 24, 2024Updated last year
dayal-kalra / low-memory-adam
View on GitHub
☆14Mar 2, 2025Updated last year
tilde-research / nsa-release
View on GitHub
An efficient implementation of the NSA (Native Sparse Attention) kernel
☆133Jun 24, 2025Updated last year
liushulinle / UloRL
View on GitHub
An Ultra-Long Output Reinforcement Learning Approach
☆23Jul 31, 2025Updated 11 months ago
theAdamColton / ijepa-enhanced
View on GitHub
recipe for training fully-featured self supervised image jepa models
☆14Jun 4, 2025Updated last year
swairshah / Intensify
View on GitHub
coloring terminal text with intensities (used for plotting probability, entropy with tokens)
☆12Oct 11, 2024Updated last year
lucidrains / hyper-connections
View on GitHub
Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public
☆187May 13, 2026Updated 2 months ago
OliverSieberling / dynamic-conv1d
View on GitHub
Triton kernels for dynamic causal short convolutions.
☆24Jun 4, 2026Updated last month
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
HKUNLP / critic-rl
View on GitHub
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆127May 6, 2025Updated last year
dhcode-cpp / Engram-pytorch
View on GitHub
pytorch implementation of DeepSeek Engram
☆19Mar 24, 2026Updated 4 months ago
NX-AI / flashrnn
View on GitHub
FlashRNN - Fast RNN Kernels with I/O Awareness
☆188Jul 22, 2026Updated last week
goombalab / raven
View on GitHub
☆78May 29, 2026Updated 2 months ago
yushuiwx / MH-MoE
View on GitHub
☆20Nov 5, 2024Updated last year
YardenBakish / PE-AWARE-LRP
View on GitHub
Official PyTorch implementation for Revisiting LRP: Positional Attribution as the Missing Ingredient for Transformer Explainability [Neur…
☆17Jul 7, 2025Updated last year
zyaaa-ux / ROSA-Tuning
View on GitHub
ROSA-Tuning
☆74Feb 4, 2026Updated 5 months ago
csslc / Self-Transcendence
View on GitHub
[ECCV 2026] Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Trans…
☆37Jul 3, 2026Updated 3 weeks ago
Infini-AI-Lab / STEM
View on GitHub
☆66May 7, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Zyphra / tree_attention
View on GitHub
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
☆134Dec 3, 2024Updated last year
HanGuo97 / log-linear-attention
View on GitHub
☆284Jun 6, 2025Updated last year
UbiquantAI / URM
View on GitHub
Universal Reasoning Model
☆134Jan 15, 2026Updated 6 months ago
Alex-Gurung / ReasoningNCP
View on GitHub
Official repo for Learning to Reason for Long-Form Story Generation
☆78Apr 19, 2025Updated last year
airs-cuhk / airsoul
View on GitHub
Next-gen Foundation Model for Embodied AI
☆32Apr 7, 2026Updated 3 months ago
rioyokotalab / swallow-code-math
View on GitHub
Ongoing research project for code&math LLMs
☆32Jul 4, 2025Updated last year
aflah02 / TokenSmith
View on GitHub
A comprehensive toolkit for streamlining data editing, search, and inspection for large-scale language model training and interpretabilit…
☆21Oct 30, 2025Updated 8 months ago