yifanzhang-pro / deep-delta-learningLinks
Official Project Page for Deep Delta Learning (https://huggingface.co/papers/2601.00417)
☆317Updated last week
Alternatives and similar repositories for deep-delta-learning
Users that are interested in deep-delta-learning are comparing it to the libraries listed below
Sorting:
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆168Updated 5 months ago
- RLP: Reinforcement as a Pretraining Objective☆223Updated 3 months ago
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆126Updated 7 months ago
- Code for Bolmo: Byteifying the Next Generation of Language Models☆115Updated last month
- Implementation of the paper "Improving Multi-step RAG with Hypergraph-based Memory for Long-context Complex Relational Modeling"☆103Updated this week
- Official JAX implementation of End-to-End Test-Time Training for Long Context☆478Updated last week
- ☆158Updated 3 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆38Updated 2 months ago
- ☆50Updated 4 months ago
- This is the official Python version of Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play.☆110Updated 3 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆251Updated 2 months ago
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆736Updated last month
- Official implementation of "Continuous Autoregressive Language Models"☆714Updated last month
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆260Updated last week
- ☆93Updated this week
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆288Updated 2 months ago
- ☆19Updated 10 months ago
- ☆106Updated 6 months ago
- Repo for "Adaptation of Agentic AI"☆572Updated this week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆358Updated 7 months ago
- This repository collects and organises state‑of‑the‑art papers on spatial reasoning for Multimodal Vision–Language Models (MVLMs).☆266Updated last week
- ☆185Updated 6 months ago
- Large multi-modal models (L3M) pre-training.☆229Updated 4 months ago
- Extending the Context of Pretrained LLMs by Dropping Their Positional Embedding☆193Updated 2 weeks ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆102Updated 4 months ago
- Demystifying Reinforcement Learning in Agentic Reasoning☆155Updated 3 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆306Updated last month
- WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups ov…☆588Updated last week
- The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"☆320Updated last week
- SSRL: Self-Search Reinforcement Learning☆205Updated 5 months ago