google-deepmind / regress-lmLinks
Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple regression tasks.
☆308Updated last month
Alternatives and similar repositories for regress-lm
Users that are interested in regress-lm are comparing it to the libraries listed below
Sorting:
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆285Updated last month
- ☆238Updated last month
- Simple & Scalable Pretraining for Neural Architecture Research☆306Updated last month
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆787Updated last week
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆190Updated 10 months ago
- RLP: Reinforcement as a Pretraining Objective☆223Updated 3 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆345Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆259Updated this week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆358Updated 6 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆124Updated 2 months ago
- Training API and CLI☆311Updated last month
- ☆237Updated 2 weeks ago
- Open-source release accompanying Gao et al. 2025☆490Updated last month
- Curated collection of community environments☆204Updated last week
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆141Updated 5 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 4 months ago
- Code for ExploreTom☆89Updated 6 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆370Updated last year
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆346Updated 3 weeks ago
- The official repository of ALE-Bench☆149Updated 2 weeks ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆126Updated 3 months ago
- Storing long contexts in tiny caches with self-study☆231Updated last month
- ☆166Updated 5 months ago
- Open source interpretability artefacts for R1.☆167Updated 9 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- AIRA-dojo: a framework for developing and evaluating AI research agents☆124Updated 2 months ago
- Official JAX implementation of End-to-End Test-Time Training for Long Context☆445Updated this week
- An interface library for RL post training with environments.☆1,066Updated this week
- ☆151Updated 4 months ago
- Implementation of SOAR☆48Updated 4 months ago