open-thought / reasoning-gym-eval
Collection of LLM completions for reasoning-gym task datasets
☆19Updated this week
Alternatives and similar repositories for reasoning-gym-eval:
Users that are interested in reasoning-gym-eval are comparing it to the libraries listed below
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆39Updated last week
- ☆38Updated 8 months ago
- σ-GPT: A New Approach to Autoregressive Models☆62Updated 8 months ago
- ☆48Updated last year
- A reading list of relevant papers and projects on foundation model annotation☆25Updated last month
- Simple GRPO scripts and configurations.☆58Updated 2 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- ☆48Updated 5 months ago
- look how they massacred my boy☆63Updated 6 months ago
- accompany material for sleep time compute paper☆17Updated this week
- ☆41Updated last week
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆60Updated 2 weeks ago
- ☆51Updated this week
- OpenPipe Reinforcement Learning Experiments☆22Updated last month
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated last month
- Latent Large Language Models☆17Updated 8 months ago
- ☆48Updated last year
- EvaByte: Efficient Byte-level Language Models at Scale☆87Updated last month
- Jax like function transformation engine but micro, microjax☆30Updated 5 months ago
- Implementation of Spectral State Space Models☆16Updated last year
- ☆128Updated 3 weeks ago
- ☆71Updated this week
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆96Updated last month
- ☆22Updated last year
- ☆20Updated 5 months ago
- ☆27Updated 9 months ago
- gzip Predicts Data-dependent Scaling Laws☆34Updated 10 months ago
- ☆17Updated 3 months ago
- Very minimal (and stateless) agent framework☆42Updated 3 months ago
- ☆17Updated 6 months ago