SakanaAI / text-to-loraLinks
Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
☆913Updated 5 months ago
Alternatives and similar repositories for text-to-lora
Users that are interested in text-to-lora are comparing it to the libraries listed below
Sorting:
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆327Updated last year
- Build your own visual reasoning model☆414Updated last month
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆484Updated this week
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆596Updated 4 months ago
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆667Updated 2 weeks ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆411Updated last month
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆348Updated 4 months ago
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆348Updated 10 months ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,162Updated 9 months ago
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆668Updated 7 months ago
- An interface library for RL post training with environments.☆687Updated this week
- ☆1,073Updated 2 weeks ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆473Updated 2 months ago
- Code release for "LLMs can see and hear without any training"☆452Updated 6 months ago
- ☆700Updated last month
- A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗☆1,078Updated last week
- The code for NeurIPS 2025 paper "A-MEM: Agentic Memory for LLM Agents"☆667Updated last week
- ☆179Updated 3 months ago
- GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's T…☆291Updated 2 months ago
- Pretraining and inference code for a large-scale depth-recurrent language model☆843Updated 3 weeks ago
- Build datasets using natural language☆543Updated last month
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆355Updated 11 months ago
- ☆158Updated 6 months ago
- Dream 7B, a large diffusion language model☆1,054Updated last month
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆743Updated this week
- ☆235Updated 8 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆568Updated 3 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆447Updated 2 months ago
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆630Updated last week
- Self-Adapting Language Models☆1,487Updated 3 months ago