google-deepmind / regress-lmLinks
Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple regression tasks.
☆295Updated last week
Alternatives and similar repositories for regress-lm
Users that are interested in regress-lm are comparing it to the libraries listed below
Sorting:
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆272Updated 2 weeks ago
- Simple & Scalable Pretraining for Neural Architecture Research☆304Updated last month
- RLP: Reinforcement as a Pretraining Objective☆205Updated 2 months ago
- ☆234Updated 5 months ago
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆705Updated last week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆239Updated this week
- Super basic implementation (gist-like) of RLMs with REPL environments.☆278Updated last month
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Updated 9 months ago
- Training API and CLI☆248Updated last week
- Source code for the collaborative reasoner research project at Meta FAIR.☆111Updated 7 months ago
- ☆231Updated last week
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆330Updated last year
- An interface library for RL post training with environments.☆829Updated this week
- Dion optimizer algorithm☆403Updated this week
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆359Updated last year
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆136Updated 3 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆353Updated 5 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆118Updated 3 weeks ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆167Updated 3 months ago
- Training-Ready RL Environments + Evals☆185Updated last week
- Storing long contexts in tiny caches with self-study☆218Updated this week
- Train your own SOTA deductive reasoning model☆107Updated 9 months ago
- All information and news with respect to Falcon-H1 series☆93Updated 2 months ago
- ☆107Updated last week
- Open source interpretability artefacts for R1.☆164Updated 7 months ago
- ☆161Updated 3 months ago
- Code for ExploreTom☆88Updated 5 months ago
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…☆391Updated 3 weeks ago
- rl from zero pretrain, can it be done? yes.☆282Updated 2 months ago