Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044
☆35Oct 3, 2024Updated last year
Alternatives and similar repositories for RATIONALYST
Users that are interested in RATIONALYST are comparing it to the libraries listed below
Sorting:
- ☆14Mar 20, 2025Updated 11 months ago
- For ACL25 paper "WAFFLE: Multi-Modal Model for Automated Front-End Development" - by Shanchao Liang and Nan Jiang and Shangshu Qian and L…☆11May 28, 2025Updated 9 months ago
- Using a reasoning LLM to learn a prompt from data☆25May 5, 2025Updated 9 months ago
- ☆12Nov 15, 2022Updated 3 years ago
- ☆11Aug 10, 2021Updated 4 years ago
- ☆19Jun 4, 2025Updated 9 months ago
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated last year
- ☆27Jun 5, 2025Updated 9 months ago
- Official Implementation of "GEAR: Augmenting Language Models with Generalizable and Efficient Tool Resolution"☆20Apr 3, 2024Updated last year
- ☆12Apr 17, 2025Updated 10 months ago
- ☆16Mar 14, 2024Updated last year
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated 11 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆18Oct 17, 2025Updated 4 months ago
- [NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆42Jul 7, 2025Updated 7 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 6 months ago
- [ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"☆57Dec 26, 2025Updated 2 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆234Jul 19, 2025Updated 7 months ago
- Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025☆45May 23, 2025Updated 9 months ago
- ☆21May 3, 2025Updated 10 months ago
- ☆23Jan 17, 2025Updated last year
- ☆20Dec 14, 2024Updated last year
- A State-Space Model with Rational Transfer Function Representation.☆83May 17, 2024Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆86Aug 10, 2024Updated last year
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- ☆24Apr 3, 2025Updated 11 months ago
- ☆124Jul 23, 2025Updated 7 months ago
- Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges☆28May 14, 2025Updated 9 months ago
- ☆17Aug 1, 2025Updated 7 months ago
- ☆20Oct 25, 2022Updated 3 years ago
- ☆130Oct 1, 2024Updated last year
- 🌏 Modular retrievers for zero-shot multilingual IR.☆30Mar 6, 2024Updated last year
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Oct 3, 2025Updated 5 months ago
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆59Mar 17, 2025Updated 11 months ago
- ☆105Mar 25, 2025Updated 11 months ago