[NeurIPS 25] The official implementation of SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning
☆29Sep 21, 2025Updated 9 months ago
Alternatives and similar repositories for SPC
Users that are interested in SPC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and data release of the paper Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows☆15Oct 4, 2024Updated last year
- [NeurIPS '25] Multi-Token Prediction Needs Registers☆29Dec 14, 2025Updated 6 months ago
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]☆16Sep 12, 2025Updated 9 months ago
- ☆29Jun 5, 2025Updated last year
- The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training☆20Mar 4, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code of LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents☆37Nov 24, 2025Updated 7 months ago
- [ICML 2025] Generalization Principles for Inference over Text-Attributed Graphs with Large Language☆22Jul 15, 2025Updated 11 months ago
- ☆14May 13, 2025Updated last year
- This repository contains reference implementation for multi-LLM ToM paper (accepted to EMNLP 2023), Theory of Mind for Multi-Agent Collab…☆20Jun 11, 2024Updated 2 years ago
- Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"☆32Jun 25, 2025Updated last year
- ☆47Apr 9, 2025Updated last year
- ☆12Mar 22, 2025Updated last year
- ☆14May 20, 2025Updated last year
- ☆14Sep 22, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation for Decision-focused Summarization (EMNLP2021)☆12Mar 14, 2022Updated 4 years ago
- ☆350Jan 29, 2026Updated 5 months ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆73Apr 22, 2025Updated last year
- This project aims at predicting correlated column pairs in data tables by analyzing column names via large language models.☆11Aug 21, 2023Updated 2 years ago
- Official Repository of NeurIPS2021 paper: PTR☆32Dec 17, 2021Updated 4 years ago
- The survey on diffusion-based graph genrative methods☆22Dec 30, 2024Updated last year
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆29Mar 2, 2026Updated 4 months ago
- [CVPR 2025] LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs☆14Jun 20, 2025Updated last year
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆1,089Apr 15, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14May 26, 2021Updated 5 years ago
- The code for the Mimic and Rephrase paper☆13Mar 19, 2023Updated 3 years ago
- ☆29Jun 25, 2024Updated 2 years ago
- Official repository for the paper "On Evaluation Metrics for Graph Generative Models"☆25Feb 6, 2022Updated 4 years ago
- ☆12May 19, 2024Updated 2 years ago
- Microsoft Complex Tasks Dataset☆17Jun 12, 2023Updated 3 years ago
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- ☆15Feb 24, 2021Updated 5 years ago
- An asymmetric 1v1 multiplayer game using Unreal Engine☆18Feb 25, 2017Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆16Apr 11, 2026Updated 2 months ago
- Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion☆13Jul 26, 2023Updated 2 years ago
- Simple python interface to be used with crisp_controllers.☆35Apr 14, 2026Updated 2 months ago
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 8 months ago
- ☆47Nov 26, 2025Updated 7 months ago
- Awesome paper lists for "A Desideratum for Conversational Agents: Capabilities, Challenges, and Future Directions""☆34Apr 25, 2025Updated last year
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Feb 12, 2023Updated 3 years ago