Collection of LLM completions for reasoning-gym task datasets
☆31Jul 4, 2025Updated 9 months ago
Alternatives and similar repositories for reasoning-gym-eval
Users that are interested in reasoning-gym-eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Building the cognitive-core to solve ARC-AGI-2☆27Feb 2, 2025Updated last year
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 7 months ago
- Mixtral-based Ja-En (En-Ja) Translation model☆20Jan 6, 2025Updated last year
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 9 months ago
- ☆15Jun 19, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Feb 9, 2024Updated 2 years ago
- Statistical test for bias in unsupervised image representations.☆12Mar 8, 2021Updated 5 years ago
- Scaling Laws for Mixture of Experts Models☆15Feb 25, 2025Updated last year
- ☆37May 15, 2025Updated 11 months ago
- https://arxiv.org/abs/2102.12594☆14Oct 3, 2023Updated 2 years ago
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆27Oct 14, 2025Updated 6 months ago
- Production-Grade Autoresearch. Ideal for GPU kernels, ML model development, feature engineering, prompt engineering, and other optimizabl…☆41Apr 8, 2026Updated last week
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,391Mar 28, 2026Updated 2 weeks ago
- Various LLM Benchmarks☆25Feb 20, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Documenting large text datasets 🖼️ 📚☆14Dec 17, 2024Updated last year
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 10 months ago
- The ChatGPT Retrieval Plugin lets you easily search and find personal or work documents by asking questions in everyday language.☆11Apr 22, 2024Updated last year
- ☆32Mar 1, 2024Updated 2 years ago
- SIMBL plugin to work around XCode ≤ 4.1's epic CPU use with iOS 5's WiFi Sync☆13Aug 3, 2011Updated 14 years ago
- Learning Embedding of 3D models with Quadric Loss☆19Dec 8, 2022Updated 3 years ago
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- A proof-of-concept custom backtester☆22Apr 3, 2024Updated 2 years ago
- Simple in-proc scheduler.☆23Aug 1, 2013Updated 12 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆15Apr 9, 2026Updated last week
- Convert Balsamiq XML export to HTML page☆28Mar 29, 2009Updated 17 years ago
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 5 months ago
- Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"☆27Mar 9, 2026Updated last month
- The official implementation of InfoRM [NeurIPS 2024].☆15Oct 25, 2025Updated 5 months ago
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆30Apr 21, 2025Updated 11 months ago
- Simple embedded java database☆31Jan 16, 2025Updated last year
- Code and data release of the paper Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows☆15Oct 4, 2024Updated last year
- OpenPipe Reinforcement Learning Experiments☆32Mar 14, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [NeurIPS 2025 Spotlight] Fast-Slow Thinking GRPO for Large Vision-Language Model Reasoning☆52Jan 20, 2026Updated 2 months ago
- HELP: a dataset for Handling Entailments with Lexical and logical Phenomena (Ver.1.0)☆15Jul 20, 2023Updated 2 years ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆17Dec 19, 2024Updated last year
- Official implementation of SimFlow☆31Dec 16, 2025Updated 4 months ago
- ☆18Apr 10, 2025Updated last year
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆29Sep 10, 2020Updated 5 years ago
- Weight-Averaged Sharpness-Aware Minimization (NeurIPS 2022)☆28Jan 13, 2023Updated 3 years ago