SakanaAI / text-to-loraLinks
Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
β936Updated 7 months ago
Alternatives and similar repositories for text-to-lora
Users that are interested in text-to-lora are comparing it to the libraries listed below
Sorting:
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.β346Updated last year
- π€ Benchmark Large Language Models Reliably On Your Dataβ423Updated 2 weeks ago
- Super basic implementation (gist-like) of RLMs with REPL environments.β390Updated last week
- An interface library for RL post training with environments.β1,004Updated this week
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"β352Updated last year
- A Tree Search Library with Flexible API for LLM Inference-Time Scalingβ512Updated last month
- dLLM: Simple Diffusion Language Modelingβ1,566Updated last week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.β357Updated 6 months ago
- β158Updated 8 months ago
- Build your own visual reasoning modelβ416Updated last month
- General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.β876Updated this week
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.β679Updated 9 months ago
- β182Updated last month
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scalingβ631Updated last month
- OpenTinker is an RL-as-a-Service infrastructure for foundation modelsβ547Updated this week
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,179Updated 11 months ago
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)β725Updated 3 weeks ago
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolutionβ773Updated last week
- Code release for "LLMs can see and hear without any training"β458Updated 8 months ago
- Dream 7B, a large diffusion language modelβ1,139Updated last month
- Collection of scripts and notebooks for OpenAI's latest GPT OSS modelsβ494Updated 4 months ago
- The code for NeurIPS 2025 paper "A-Mem: Agentic Memory for LLM Agents"β747Updated 2 weeks ago
- A lightweight, local-first, and π experiment tracking library from Hugging Face π€β1,209Updated this week
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsβ¦β370Updated last year
- β252Updated 10 months ago
- GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's Tβ¦β323Updated 4 months ago
- A character-level language diffusion model trained on Tiny Shakespeareβ824Updated 2 weeks ago
- Build datasets using natural languageβ558Updated 3 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)β459Updated 4 months ago
- Simple & Scalable Pretraining for Neural Architecture Researchβ306Updated last month