genlm / genlm-controlLinks
Controlled text generation with programmable constraints
☆159Updated last week
Alternatives and similar repositories for genlm-control
Users that are interested in genlm-control are comparing it to the libraries listed below
Sorting:
- Repository for the paper Stream of Search: Learning to Search in Language☆151Updated 9 months ago
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆294Updated this week
- ☆143Updated 2 months ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆261Updated this week
- Open source interpretability artefacts for R1.☆163Updated 7 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Updated 8 months ago
- Storing long contexts in tiny caches with self-study☆217Updated last month
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆329Updated last year
- Public repository containing METR's DVC pipeline for eval data analysis☆138Updated 7 months ago
- ☆87Updated this week
- ☆111Updated 9 months ago
- Probabilistic programming with large language models☆145Updated last week
- A domain-specific probabilistic programming language for modeling and inference with language models☆137Updated 7 months ago
- ☆150Updated last week
- ☆33Updated 6 months ago
- ☆105Updated 4 months ago
- Implementation of SOAR☆43Updated 2 months ago
- Evaluation of LLMs on latest math competitions☆193Updated last month
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆684Updated last week
- ☆104Updated 10 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 10 months ago
- Jax Codebase for Evolutionary Strategies at the Hyperscale☆40Updated last week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆349Updated 5 months ago
- accompanying material for sleep-time compute paper☆117Updated 7 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆273Updated last month
- Source code for the collaborative reasoner research project at Meta FAIR.☆106Updated 7 months ago
- ☆11Updated 7 months ago
- ☆233Updated 5 months ago
- ☆124Updated 9 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆234Updated 4 months ago