SakanaAI/doc-to-lora

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SakanaAI/doc-to-lora)

SakanaAI / doc-to-lora

Hypernetworks that update LLMs to remember factual information

☆779

Alternatives and similar repositories for doc-to-lora

Users that are interested in doc-to-lora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SakanaAI / text-to-lora
View on GitHub
Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
☆1,293Jun 8, 2025Updated last year
MuLabPKU / SHINE
View on GitHub
The repo for SHINE: A Scalable In-Context Hypernetwork for Mapping Context to LoRA in a Single Pass
☆90May 23, 2026Updated last month
SakanaAI / fast-weight-product-key-memory
View on GitHub
Code for Fast-weight Product Key Memory (FwPKM)
☆19Mar 18, 2026Updated 3 months ago
NVlabs / LoRWeB
View on GitHub
We propose a novel modular framework that learns to dynamically mix low-rank adapters (LoRAs) to improve visual analogy learning, enablin…
☆74Jun 22, 2026Updated 3 weeks ago
liangbingzhao / PhysicEdit
View on GitHub
[ICML2026] From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors
☆92Apr 30, 2026Updated 2 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
alexzhang13 / rlm
View on GitHub
General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.
☆5,260Jun 26, 2026Updated 3 weeks ago
SakanaAI / sparser-faster-llms
View on GitHub
Cuda kernels for leveraging LLM sparsity to improve throughput and decrease the memory requirements during inference and training.
☆253Jun 29, 2026Updated 2 weeks ago
SakanaAI / DiffusionBlocks
View on GitHub
DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation
☆241Feb 18, 2026Updated 4 months ago
ysharma3501 / LavaSR
View on GitHub
🌋LavaSR: Fast Speech restoration and enhancement
☆562Jun 19, 2026Updated 3 weeks ago
lasgroup / SDPO
View on GitHub
Reinforcement Learning via Self-Distillation (SDPO)
☆1,011Jul 1, 2026Updated 2 weeks ago
ypwang61 / ThetaEvolve
View on GitHub
ThetaEvolve: Test-time Learning on Open Problems, enabling RL training on AlphaEvolve/OpenEvolve and emphasizing scaling test-time comput…
☆169Feb 27, 2026Updated 4 months ago
Gen-Verse / OpenClaw-RL
View on GitHub
OpenClaw-RL: Train any agent simply by talking
☆5,573May 23, 2026Updated last month
test-time-training / e2e
View on GitHub
Official JAX implementation of End-to-End Test-Time Training for Long Context
☆624Feb 15, 2026Updated 5 months ago
gepa-ai / gepa
View on GitHub
Optimize prompts, code, and more with AI-powered Reflective Text Evolution
☆5,660Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
xk-huang / VecGlypher
View on GitHub
[CVPR'26] VecGlypher: Unified Vector Glyph Generation with Language Models
☆135Feb 26, 2026Updated 4 months ago
yangdongchao / UniAudio2Demo
View on GitHub
☆26Feb 10, 2026Updated 5 months ago
huawei-bayerlab / windowseat-reflection-removal
View on GitHub
Reflection Removal through Efficient Adaptation of Diffusion Transformers
☆131Apr 21, 2026Updated 2 months ago
hanjq17 / Spectrum
View on GitHub
[CVPR 2026] Adaptive Spectral Feature Forecasting for Diffusion Sampling Acceleration
☆126Apr 30, 2026Updated 2 months ago
felixtaubner / mvp4d
View on GitHub
Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"
☆43Mar 24, 2026Updated 3 months ago
metauto-ai / NeuralComputer
View on GitHub
🖥 Neural Computers' Data Engine
☆200May 19, 2026Updated last month
TIGER-AI-Lab / OpenResearcher
View on GitHub
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis
☆888Jun 10, 2026Updated last month
emrecanacikgoz / Tool-R0
View on GitHub
☆35Apr 3, 2026Updated 3 months ago
hustvl / MoDA
View on GitHub
An hardware-aware Efficient Implementation for "Mixture-of-Depths Attention".
☆273May 6, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Tencent-Hunyuan / HY-WU
View on GitHub
HY-WU (Part I): An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing
☆295Mar 18, 2026Updated 3 months ago
g-luo / generative_latent_prior
View on GitHub
Official PyTorch Implementation for Learning a Generative Meta-Model of LLM Activations, ICML 2026
☆90Apr 30, 2026Updated 2 months ago
WeiboAI / VibeThinker
View on GitHub
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B
☆1,479Jun 17, 2026Updated last month
huggingface / feel
View on GitHub
☆15May 26, 2026Updated last month
zksha / alma
View on GitHub
ALMA (Automated meta-Learning of Memory designs for Agentic systems) is a framework that meta-learns memory designs to replace human-engi…
☆226Apr 8, 2026Updated 3 months ago
KangsanKim07 / MemoryTransferLearning
View on GitHub
Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents
☆31Apr 16, 2026Updated 3 months ago
lambda-calculus-LLM / lambda-RLM
View on GitHub
Method for Long Context RLMs using verifiable Lambda Calculus
☆303Apr 24, 2026Updated 2 months ago
facebookresearch / HyperAgents
View on GitHub
Self-referential self-improving agents that can optimize for any computable task
☆2,637May 9, 2026Updated 2 months ago
SakanaAI / treequest
View on GitHub
A Tree Search Library with Flexible API for LLM Inference-Time Scaling
☆554Feb 5, 2026Updated 5 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tue-mps / videomt
View on GitHub
[CVPR 2026] Official code and models for Video Encoder-only Mask Transformer (VidEoMT).
☆252Jun 23, 2026Updated 3 weeks ago
imbue-ai / darwinian_evolver
View on GitHub
Framework for evolving code and prompts inspired by Darwinian evolution
☆481Jun 1, 2026Updated last month
seal-rg / recurrent-pretraining
View on GitHub
Pretraining and inference code for a large-scale depth-recurrent language model
☆899Dec 29, 2025Updated 6 months ago
MoonshotAI / Attention-Residuals
View on GitHub
☆3,327Mar 17, 2026Updated 4 months ago
huggingface / ml-intern
View on GitHub
🤗 ml-intern: an open-source ML engineer that reads papers, trains models, and ships ML models
☆10,654Jul 9, 2026Updated last week
FudanCVL / GlyphPrinter
View on GitHub
[CVPR 2026 Highlight] GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering
☆104Apr 9, 2026Updated 3 months ago
ByteDance-Seed / In-Place-TTT
View on GitHub
☆244Apr 21, 2026Updated 2 months ago