[NeurIPS 25] The official implementation of SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning
☆29Sep 21, 2025Updated 8 months ago
Alternatives and similar repositories for SPC
Users that are interested in SPC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of "Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts" at EMNLP 202…☆13Oct 27, 2024Updated last year
- Computed Appraisals Model. Code and data for the 2023 paper, "Emotion prediction as computation over a generative theory of mind"☆13Jun 12, 2023Updated 3 years ago
- Code and data release of the paper Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows☆15Oct 4, 2024Updated last year
- [NeurIPS '25] Multi-Token Prediction Needs Registers☆29Dec 14, 2025Updated 6 months ago
- ☆29Jun 5, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training☆20Mar 4, 2025Updated last year
- ☆83May 2, 2026Updated last month
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback☆12Jul 13, 2022Updated 3 years ago
- ☆47Apr 9, 2025Updated last year
- ☆12Mar 22, 2025Updated last year
- EMNLP 2021: A Label-Aware BERT Attention Network for Zero-Shot Multi-Intent Detection in Spoken Language Understanding☆10Apr 8, 2022Updated 4 years ago
- ☆14Sep 22, 2025Updated 8 months ago
- Implementation for Decision-focused Summarization (EMNLP2021)☆12Mar 14, 2022Updated 4 years ago
- ☆346Jan 29, 2026Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆73Apr 22, 2025Updated last year
- This project aims at predicting correlated column pairs in data tables by analyzing column names via large language models.☆11Aug 21, 2023Updated 2 years ago
- ☆13Apr 30, 2020Updated 6 years ago
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆28Mar 2, 2026Updated 3 months ago
- [CVPR 2025] LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs☆14Jun 20, 2025Updated 11 months ago
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆1,083Apr 15, 2026Updated last month
- A collection of important papers on Generalizable Diffusion-generated Image Detection☆22Mar 20, 2025Updated last year
- The code for the Mimic and Rephrase paper☆13Mar 19, 2023Updated 3 years ago
- ☆29Jun 25, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Microsoft Complex Tasks Dataset☆17Jun 12, 2023Updated 3 years ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Feb 4, 2026Updated 4 months ago
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- ☆15Feb 24, 2021Updated 5 years ago
- An asymmetric 1v1 multiplayer game using Unreal Engine☆18Feb 25, 2017Updated 9 years ago
- 2020腾讯广告算法大赛初赛rank6,复赛rank11队伍(wujie代码)☆12Apr 12, 2021Updated 5 years ago
- ☆16Apr 11, 2026Updated 2 months ago
- Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion☆13Jul 26, 2023Updated 2 years ago
- ☆38Sep 11, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆34Apr 23, 2026Updated last month
- Simple python interface to be used with crisp_controllers.☆35Apr 14, 2026Updated 2 months ago
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 7 months ago
- Awesome paper lists for "A Desideratum for Conversational Agents: Capabilities, Challenges, and Future Directions""☆34Apr 25, 2025Updated last year
- Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025☆35Feb 22, 2026Updated 3 months ago
- The official implementation of InfoRM [NeurIPS 2024].☆15Oct 25, 2025Updated 7 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆73Feb 25, 2025Updated last year