SakanaAI / text-to-loraLinks
Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
☆649Updated 2 weeks ago
Alternatives and similar repositories for text-to-lora
Users that are interested in text-to-lora are comparing it to the libraries listed below
Sorting:
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆311Updated 8 months ago
- Live-bending a foundation model’s output at neural network level.☆258Updated 2 months ago
- ☆149Updated 2 months ago
- Build your own visual reasoning model☆381Updated this week
- Agent Reinforcement Trainer for training multi-turn agents using GRPO☆730Updated this week
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆632Updated 3 months ago
- Dream 7B, a large diffusion language model☆764Updated last week
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆498Updated this week
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆343Updated 6 months ago
- Pretraining code for a large-scale depth-recurrent language model☆782Updated last week
- 🤗 Benchmark Large Language Models Reliably On Your Data☆329Updated this week
- Synthetic data curation for post-training and structured data extraction☆1,404Updated this week
- Code release for "LLMs can see and hear without any training"☆443Updated last month
- Self-Adapting Language Models☆430Updated this week
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆545Updated 3 months ago
- Build datasets using natural language☆492Updated last month
- LIMO: Less is More for Reasoning☆960Updated 2 months ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,100Updated 4 months ago
- PyTorch implementation of models from the Zamba2 series.☆182Updated 4 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆988Updated 3 weeks ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆271Updated 2 months ago
- ☆157Updated last month
- Tina: Tiny Reasoning Models via LoRA☆258Updated 3 weeks ago
- II-Researcher: a new open-source framework designed to aid building search / research agents☆373Updated last month
- Code implementation for paper "A-mem: Agentic Memory for LLM Agents"☆456Updated 3 weeks ago
- Exploring Applications of GRPO☆230Updated last month
- ☆331Updated last week
- GRadient-INformed MoE☆263Updated 8 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆337Updated 6 months ago
- See Through Your Models☆393Updated 3 months ago