SakanaAI / text-to-loraLinks
Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
☆928Updated 6 months ago
Alternatives and similar repositories for text-to-lora
Users that are interested in text-to-lora are comparing it to the libraries listed below
Sorting:
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆343Updated last year
- Build your own visual reasoning model☆415Updated last month
- dLLM: Simple Diffusion Language Modeling☆1,504Updated this week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆353Updated 6 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆504Updated 2 weeks ago
- An interface library for RL post training with environments.☆848Updated last week
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆351Updated last year
- 🤗 Benchmark Large Language Models Reliably On Your Data☆419Updated last week
- A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗☆1,152Updated last week
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆710Updated this week
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆620Updated last month
- Code release for "LLMs can see and hear without any training"☆454Updated 7 months ago
- ☆1,233Updated last month
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,177Updated 10 months ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆780Updated this week
- Pretraining and inference code for a large-scale depth-recurrent language model☆856Updated 2 months ago
- GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's T…☆312Updated 4 months ago
- Dream 7B, a large diffusion language model☆1,115Updated last month
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆485Updated 3 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆451Updated 3 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆286Updated 2 months ago
- Async RL Training at Scale☆950Updated this week
- ☆185Updated 3 weeks ago
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆676Updated 9 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆581Updated 4 months ago
- The code for NeurIPS 2025 paper "A-MEM: Agentic Memory for LLM Agents"☆731Updated last month
- Open-source release accompanying Gao et al. 2025☆450Updated last week
- Inference, Fine Tuning and many more recipes with Gemma family of models☆276Updated 5 months ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆304Updated 3 weeks ago
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,283Updated last week