SakanaAI / text-to-loraLinks
Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
☆798Updated last month
Alternatives and similar repositories for text-to-lora
Users that are interested in text-to-lora are comparing it to the libraries listed below
Sorting:
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆314Updated 8 months ago
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆1,108Updated this week
- Self-Adapting Language Models☆672Updated 3 weeks ago
- ☆156Updated 2 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆298Updated this week
- 🤗 Benchmark Large Language Models Reliably On Your Data☆354Updated last week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆295Updated 2 weeks ago
- Build your own visual reasoning model☆395Updated this week
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆640Updated 3 months ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,123Updated 5 months ago
- Atom of Thoughts for Markov LLM Test-Time Scaling☆577Updated 3 weeks ago
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆345Updated 6 months ago
- Code release for "LLMs can see and hear without any training"☆447Updated 2 months ago
- Pretraining code for a large-scale depth-recurrent language model☆793Updated last month
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆535Updated this week
- ☆214Updated 4 months ago
- Code implementation for paper "A-mem: Agentic Memory for LLM Agents"☆477Updated last month
- Build datasets using natural language☆498Updated 2 months ago
- Dream 7B, a large diffusion language model☆816Updated 3 weeks ago
- procedural reasoning datasets☆938Updated this week
- GRadient-INformed MoE☆263Updated 9 months ago
- ☆162Updated 2 months ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆273Updated 2 weeks ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆341Updated 7 months ago
- Exploring Applications of GRPO☆240Updated this week
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆529Updated 2 weeks ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆223Updated last week
- Synthetic data curation for post-training and structured data extraction☆1,434Updated this week
- An agent benchmark with tasks in a simulated software company.☆468Updated 2 weeks ago
- Live-bending a foundation model’s output at neural network level.☆262Updated 3 months ago