SakanaAI / text-to-loraLinks
Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
☆839Updated 2 months ago
Alternatives and similar repositories for text-to-lora
Users that are interested in text-to-lora are comparing it to the libraries listed below
Sorting:
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆318Updated 10 months ago
- Build your own visual reasoning model☆405Updated this week
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆444Updated 3 weeks ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆385Updated 2 weeks ago
- ☆155Updated 4 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆328Updated 2 months ago
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆348Updated 8 months ago
- Atom of Thoughts for Markov LLM Test-Time Scaling☆583Updated 2 months ago
- Self-Adapting Language Models☆758Updated 3 weeks ago
- Code release for "LLMs can see and hear without any training"☆451Updated 3 months ago
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆658Updated 5 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆266Updated last month
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆501Updated last week
- ☆173Updated 2 weeks ago
- A-MEM: Agentic Memory for LLM Agents☆517Updated 3 weeks ago
- Dream 7B, a large diffusion language model☆915Updated this week
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,136Updated 6 months ago
- A lightweight, local-first, and free experiment tracking Python library built on top of 🤗 Datasets and Spaces.☆647Updated this week
- CodeScientist: An automated scientific discovery system for code-based experiments☆289Updated last month
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆387Updated last week
- Build datasets using natural language☆515Updated 3 months ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆586Updated last week
- Simple & Scalable Pretraining for Neural Architecture Research☆287Updated 2 weeks ago
- ☆220Updated 5 months ago
- GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's T…☆230Updated 2 weeks ago
- ☆401Updated this week
- Live-bending a foundation model’s output at neural network level.☆266Updated 4 months ago
- ☆228Updated last month
- Official repository for "NoLiMa: Long-Context Evaluation Beyond Literal Matching"☆140Updated last month
- Official python implementation of the UTCP☆440Updated this week