probcomp / genlm-controlLinks
☆11Updated 5 months ago
Alternatives and similar repositories for genlm-control
Users that are interested in genlm-control are comparing it to the libraries listed below
Sorting:
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆20Updated 6 months ago
- ☆31Updated 6 months ago
- ☆42Updated last year
- implementation of dualformer☆20Updated 6 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Updated 11 months ago
- ☆45Updated last year
- Minimum Description Length probing for neural network representations☆20Updated 8 months ago
- ☆85Updated last year
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆84Updated 10 months ago
- ☆19Updated 6 months ago
- Harmonic Datasets☆48Updated last year
- ☆33Updated 8 months ago
- A repository for research on medium sized language models.☆78Updated last year
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Updated last year
- A testbed for agents and environments that can automatically improve models through data generation.☆27Updated 6 months ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆80Updated 10 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆78Updated last year
- ☆34Updated last year
- Code for the paper "Function-Space Learning Rates"☆23Updated 3 months ago
- The Energy Transformer block, in JAX☆59Updated last year
- A domain-specific probabilistic programming language for modeling and inference with language models☆136Updated 4 months ago
- This is the official repository for all the code of TheoremLlama☆45Updated last month
- Lottery Ticket Adaptation☆39Updated 10 months ago
- Controlled text generation with programmable constraints☆137Updated last week
- KV Cache Steering for Inducing Reasoning in Small Language Models☆40Updated 2 months ago
- ☆47Updated 5 months ago
- ☆27Updated last year
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆68Updated 9 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated 2 years ago