Collection of LLM completions for reasoning-gym task datasets
☆30Jul 4, 2025Updated 8 months ago
Alternatives and similar repositories for reasoning-gym-eval
Users that are interested in reasoning-gym-eval are comparing it to the libraries listed below
Sorting:
- ☆21Jul 9, 2025Updated 7 months ago
- Building the cognitive-core to solve ARC-AGI-2☆27Feb 2, 2025Updated last year
- Various LLM Benchmarks☆24Feb 20, 2026Updated 2 weeks ago
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆27Oct 14, 2025Updated 4 months ago
- Machine Learning from Human Preferences☆28Feb 13, 2026Updated 3 weeks ago
- Mixtral-based Ja-En (En-Ja) Translation model☆20Jan 6, 2025Updated last year
- OpenPipe Reinforcement Learning Experiments☆32Mar 14, 2025Updated 11 months ago
- ☆37May 15, 2025Updated 9 months ago
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 7 months ago
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆29Apr 21, 2025Updated 10 months ago
- [NeurIPS 2025 Spotlight] Fast-Slow Thinking GRPO for Large Vision-Language Model Reasoning☆49Jan 20, 2026Updated last month
- Codes and datasets for adaptive spline fitting method SHAPES☆10Sep 27, 2024Updated last year
- Implementations of Curious Replay for model-based adaptation.☆43Jul 5, 2023Updated 2 years ago
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,352Jan 16, 2026Updated last month
- Gravitational wave interferometer parameter optimisation game, written in Python and run in a Jupyter notebook.☆10Dec 18, 2018Updated 7 years ago
- ☆11Jan 11, 2022Updated 4 years ago
- ☆16Feb 22, 2025Updated last year
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- A Sensor Streamer for Android Wear OS☆14Feb 9, 2024Updated 2 years ago
- ☆14Mar 21, 2024Updated last year
- Gestro revolutionises the way of controlling one’s PC in real-time with the use of hand gestures, made possible using Computer Vision app…☆13Apr 21, 2022Updated 3 years ago
- LLM Skirmish☆44Feb 3, 2026Updated last month
- code for polite☆11Feb 28, 2024Updated 2 years ago
- Mirror of pyseobnr repository from LIGO☆10Feb 22, 2026Updated last week
- Library of Octave functions for continuous gravitational-wave data analysis☆12Feb 28, 2023Updated 3 years ago
- About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"☆13Oct 7, 2023Updated 2 years ago
- Codes to compute the WDM wavelet transform☆11Oct 20, 2020Updated 5 years ago
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- Trans-dimensional Bayesian sampler for gravitational-wave data analysis in pulsar timing array data☆11Dec 3, 2021Updated 4 years ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆29Feb 23, 2026Updated last week
- Source code repository for the AISTAT 2023 paper Transport Reversible Jump Proposals.☆10Mar 3, 2023Updated 3 years ago
- A collection of heat engines, based on the OpenAI Gym environment framework for use with reinforcement learning applications.☆15Dec 20, 2021Updated 4 years ago
- Train your own SOTA deductive reasoning model☆107Mar 6, 2025Updated last year
- ☆74Jun 28, 2025Updated 8 months ago
- ☆53Feb 19, 2025Updated last year
- ☆39Aug 9, 2022Updated 3 years ago
- Isaac Gym Reinforcement Learning Environments for humanoid robot Bez☆10Jul 27, 2022Updated 3 years ago
- Spectral siren cosmology with the population of compact binary coalescences☆11Jul 9, 2024Updated last year