yifanzhang-pro/deep-delta-learning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yifanzhang-pro/deep-delta-learning)

yifanzhang-pro / deep-delta-learning

Official Project Page for Deep Delta Learning (https://arxiv.org/abs/2601.00417)

☆356

Alternatives and similar repositories for deep-delta-learning

Users that are interested in deep-delta-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pi-apps / pi-sdk-rails
View on GitHub
☆19Mar 19, 2026Updated 4 months ago
lcqysl / DiffThinker
View on GitHub
[ICML 2026] Official repo for "DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models"
☆186Jan 4, 2026Updated 6 months ago
greltel / abap-point-gate
View on GitHub
ABAP Cloud-ready framework for isolating ABAP exit points.
☆32Mar 31, 2026Updated 3 months ago
fsmunoz / datastar-cl
View on GitHub
Datastar Common Lisp SDK
☆62Jun 28, 2026Updated 3 weeks ago
wdlctc / delta-attention-residuals-code
View on GitHub
Delta Attention Residuals - supplementary code and pretrained models
☆40May 20, 2026Updated 2 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
rodmena-limited / stabilize
View on GitHub
Queue-Based State Machine - A lightweight workflow execution engine with DAG-based stage orchestration. Unlike simple task queues (like …
☆86Jul 5, 2026Updated 3 weeks ago
SakanaAI / drq
View on GitHub
Digital Red Queen: Adversarial Program Evolution in Core War with LLMs
☆214Jan 13, 2026Updated 6 months ago
YannDubs / Mini_Decodable_Information_Bottleneck
View on GitHub
Minimum viable code for the Decodable Information Bottleneck paper. Pytorch Implementation.
☆12Oct 20, 2020Updated 5 years ago
Yifei-Zuo / Parallax
View on GitHub
Official repository for Parallax (Parameterized Local Linear Attention)
☆65Jul 7, 2026Updated 2 weeks ago
fla-org / flame
View on GitHub
🔥 A minimal training framework for scaling FLA models
☆403Apr 22, 2026Updated 3 months ago
locuslab / EqR
View on GitHub
[ICML 2026] Code for Equilibrium Reasoners: learning attractor dynamics for scalable reasoning
☆45Jun 1, 2026Updated last month
model-architectures / GRAPE
View on GitHub
[ICLR 2026] GRAPE: Group Representational Position Encoding (https://arxiv.org/abs/2512.07805)
☆115Jun 15, 2026Updated last month
MoonshotAI / Kimi-Linear
View on GitHub
☆1,491Nov 17, 2025Updated 8 months ago
test-time-training / e2e
View on GitHub
Official JAX implementation of End-to-End Test-Time Training for Long Context
☆626Feb 15, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
facebookresearch / PhysicsLM4
View on GitHub
Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality
☆356May 20, 2026Updated 2 months ago
tilde-research / wall-attention-release
View on GitHub
Attention variant with per-channel multiplicative decay
☆48Jun 3, 2026Updated last month
peterpaohuang / flux_matching
View on GitHub
Generative Modeling with Flux Matching
☆65May 11, 2026Updated 2 months ago
naklecha / simple-llm
View on GitHub
~950 line, minimal, extensible LLM inference engine built from scratch.
☆478Jan 9, 2026Updated 6 months ago
lucidrains / metacontroller
View on GitHub
Implementation of the MetaController proposed in "Emergent temporal abstractions in autoregressive models enable hierarchical reinforceme…
☆106Jul 8, 2026Updated 2 weeks ago
tokenbender / mHC-manifold-constrained-hyper-connections
View on GitHub
implementations and experimentation on mHC by deepseek - https://arxiv.org/abs/2512.24880
☆369Feb 17, 2026Updated 5 months ago
pipethedev / distributed-system-algorithms
View on GitHub
☆270Jan 3, 2026Updated 6 months ago
BryceZhuo / HybridNorm
View on GitHub
The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
☆19Mar 7, 2025Updated last year
modula-systems / modula
View on GitHub
🧱 Modula software package
☆337Aug 18, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mmaaz-git / pdlp
View on GitHub
PDLP algorithm for linear programming
☆98Dec 31, 2025Updated 6 months ago
alexiglad / EBT
View on GitHub
PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning
☆639Apr 21, 2026Updated 3 months ago
Sphere-AI-Lab / fda
View on GitHub
Implementation of <Model Merging with Functional Dual Anchors>
☆46Nov 23, 2025Updated 8 months ago
allenai / bolmo-core
View on GitHub
Code for Bolmo: Byteifying the Next Generation of Language Models
☆136Jul 6, 2026Updated 2 weeks ago
test-time-training / discover
View on GitHub
☆611May 24, 2026Updated 2 months ago
MaitySubhajit / KArAt
View on GitHub
Kolmogorov-Arnold Attention: Is Learnable Attention Better for Vision Transformers?
☆16Jul 9, 2025Updated last year
aHapBean / xHC
View on GitHub
[Tech Report] Expanded Hyper-Connections
☆49Updated this week
ZihaoHuang-notabot / ConceptMoE
View on GitHub
☆45Jan 30, 2026Updated 5 months ago
WeichenFan / UAE
View on GitHub
Official repo for UAE
☆207Jun 21, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
RiddleHe / nanochat
View on GitHub
The best ChatGPT that $100 can buy.
☆54Updated this week
openai / circuit_sparsity
View on GitHub
Open-source release accompanying Gao et al. 2025
☆530Dec 11, 2025Updated 7 months ago
ASTRAL-Group / LoRe
View on GitHub
When Reasoning Meets Its Laws
☆38Jan 2, 2026Updated 6 months ago
deepseek-ai / Engram
View on GitHub
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
☆4,561Jan 14, 2026Updated 6 months ago
olivkoch / TinyRecursiveModels
View on GitHub
☆35Nov 11, 2025Updated 8 months ago
mit-zardini-lab / pyncd
View on GitHub
☆68Apr 8, 2026Updated 3 months ago
goombalab / Gather-and-Aggregate
View on GitHub
Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"
☆16Apr 30, 2025Updated last year