Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044
☆36Oct 3, 2024Updated last year
Alternatives and similar repositories for RATIONALYST
Users that are interested in RATIONALYST are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of "GEAR: Augmenting Language Models with Generalizable and Efficient Tool Resolution"☆20Apr 3, 2024Updated 2 years ago
- ☆14Nov 15, 2022Updated 3 years ago
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated 2 years ago
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- Suri: Multi-constraint instruction following for long-form text generation [EMNLP’24]☆27Oct 3, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆89Aug 10, 2024Updated last year
- ☆25Apr 3, 2025Updated last year
- Using a reasoning LLM to learn a prompt from data☆25May 5, 2025Updated last year
- Code for the paper Factorizing Content and Budget Decisions in Abstractive Summarization of Long Documents: https://arxiv.org/abs/2205.12…☆12Feb 10, 2024Updated 2 years ago
- ☆52Mar 9, 2026Updated 3 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆13Jun 26, 2026Updated last week
- ☆29Mar 13, 2026Updated 3 months ago
- This repo explores how AMR to address tasks difficult for LLMs☆13Jan 15, 2024Updated 2 years ago
- The Lean Theorem Proving Environment☆15May 7, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Aug 10, 2021Updated 4 years ago
- ☆25Jan 17, 2025Updated last year
- ☆15Mar 20, 2025Updated last year
- ☆11Jun 2, 2022Updated 4 years ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆240Jul 19, 2025Updated 11 months ago
- ☆11Mar 25, 2022Updated 4 years ago
- [ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"☆63Dec 26, 2025Updated 6 months ago
- [NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆42Apr 10, 2026Updated 2 months ago
- Toolkit for Seamlessly Enabling RL Training on Any Agent with Bedrock AgentCore.☆46Jun 22, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- TensorFlow Quantization Example, for TensorFlow Lite☆18Aug 4, 2019Updated 6 years ago
- ☆17Mar 14, 2024Updated 2 years ago
- Wenzhou-Kean University AI-LAB☆10Jun 6, 2022Updated 4 years ago
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- ☆21Apr 3, 2026Updated 3 months ago
- ☆14Jun 9, 2017Updated 9 years ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆16Aug 11, 2023Updated 2 years ago
- A package containing utils for the PyTorch version of the Tapas algorithm.☆11Apr 29, 2021Updated 5 years ago
- ☆10Sep 14, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆20Oct 25, 2022Updated 3 years ago
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆43Sep 18, 2025Updated 9 months ago
- [ACL2026 oral] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmark☆25Apr 13, 2026Updated 2 months ago
- Created an inverted index in Python for document retreival☆13Oct 7, 2018Updated 7 years ago
- ☆14Jan 10, 2024Updated 2 years ago
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…☆11Oct 18, 2022Updated 3 years ago
- ☆23Dec 17, 2024Updated last year