chen-judge/SPC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chen-judge/SPC)

chen-judge / SPC

[NeurIPS 25] The official implementation of SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning

☆30

Alternatives and similar repositories for SPC

Users that are interested in SPC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tangzhy / RealCritic
View on GitHub
☆15Jan 27, 2025Updated last year
daeh / computed-appraisals
View on GitHub
Computed Appraisals Model. Code and data for the 2023 paper, "Emotion prediction as computation over a generative theory of mind"
☆13Jun 12, 2023Updated 3 years ago
TamSiuhin / Per-Pcs
View on GitHub
Official Implementation of "Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts" at EMNLP 202…
☆13Oct 27, 2024Updated last year
GAIR-NLP / LIMOPro
View on GitHub
☆15May 27, 2025Updated last year
rtmaww / O_CILNER
View on GitHub
Code for ACL 2023 paper "Learning 'O' Helps for Learning More: Handling the Unlabeled Entity Problem for Class-incremental NER"
☆10Jul 17, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
WangHanLinHenry / SPA-RL-Agent
View on GitHub
Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"
☆89Sep 13, 2025Updated 10 months ago
mansicer / self-verification
View on GitHub
☆18Dec 23, 2025Updated 7 months ago
TIGER-AI-Lab / PixelWorld
View on GitHub
The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]
☆15Sep 12, 2025Updated 10 months ago
tanzelin430 / The-Scaling-Law-for-Reinforcement-Learning
View on GitHub
[ACL2026]Code Repo for paper "Scaling Behaviors of LLM Reinforcement Learning Post-Training"
☆24Jul 1, 2026Updated 3 weeks ago
VITA-Group / Data-Efficient-Scaling
View on GitHub
[ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang
☆14Jan 4, 2024Updated 2 years ago
boluoweifenda / werewolf
View on GitHub
☆28May 20, 2024Updated 2 years ago
Vinita99 / botnet_detection
View on GitHub
Botnet is a form of malware that attacks computers on the internet and controls them with command and control servers to perform a wide v…
☆11May 12, 2020Updated 6 years ago
jinpz / q_sharp
View on GitHub
The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training
☆20Mar 4, 2025Updated last year
heaplax / ARMAP
View on GitHub
☆29Jun 5, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
meta-pytorch / popcorn-kernels
View on GitHub
For building the world's largest dataset of GPU kernels.
☆11Updated this week
IBM / transformers-struct-guidance
View on GitHub
Code for the ACL 2021 paper "Structural Guidance for Transformer Language Models"
☆15Sep 17, 2025Updated 10 months ago
Linear95 / SPAG
View on GitHub
Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024
☆145Feb 24, 2025Updated last year
langmanbusi / InsViE
View on GitHub
Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”
☆34Apr 3, 2026Updated 3 months ago
rhyang2021 / ARIA
View on GitHub
Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".
☆30Aug 9, 2025Updated 11 months ago
MasterVito / SwS
View on GitHub
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆42Nov 11, 2025Updated 8 months ago
tehranixyz / CodeRosetta
View on GitHub
CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for Parallel Programming
☆11Nov 18, 2024Updated last year
wantbook-book / SeRL
View on GitHub
SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data
☆24Jan 24, 2026Updated 6 months ago
Hambaobao / Marathon
View on GitHub
Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.
☆10May 16, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
HauffQian / DGAP
View on GitHub
☆14May 13, 2025Updated last year
sky-bro / extract-ssl-certs-from-pcap
View on GitHub
extract ssl certs from pcap file, only for tls-v1.2
☆10Nov 3, 2020Updated 5 years ago
McGill-NLP / feedbackqa
View on GitHub
FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback
☆12Jul 13, 2022Updated 4 years ago
wenjunsun / dlsys-needle-m1
View on GitHub
Final project for the class "Deep Learning Systems Algorithms and Implementation" from CMU, where we try to make needle work with Apple M…
☆10Jan 8, 2023Updated 3 years ago
MetabrainAGI / Awaker2.5-R1
View on GitHub
☆12Mar 22, 2025Updated last year
nasosger / MuToR
View on GitHub
[NeurIPS '25] Multi-Token Prediction Needs Registers
☆30Dec 14, 2025Updated 7 months ago
ShadeCloak / ADORA
View on GitHub
☆47Apr 9, 2025Updated last year
JackKelly / UK-DALE_metadata
View on GitHub
Metadata for my UK Domestic Appliance-Level Electricity (UK-DALE) dataset
☆16Jul 16, 2017Updated 9 years ago
hobart07 / Step1X-Edit_train
View on GitHub
☆14May 20, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Akshit21112002 / TTRV
View on GitHub
TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)
☆46Mar 8, 2026Updated 4 months ago
chen-judge / UniGeo
View on GitHub
[EMNLP 22] UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression
☆34Dec 7, 2022Updated 3 years ago
ChicagoHAI / decsum
View on GitHub
Implementation for Decision-focused Summarization (EMNLP2021)
☆12Mar 14, 2022Updated 4 years ago
world-action-verifier / wav_robot
View on GitHub
☆17Apr 16, 2026Updated 3 months ago
iwangjian / Coding-Tutor
View on GitHub
[ACL 2025 Findings] Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors
☆90Jun 2, 2025Updated last year
chtmp223 / suri
View on GitHub
Suri: Multi-constraint instruction following for long-form text generation [EMNLP’24]
☆27Oct 3, 2025Updated 9 months ago
IvanLetteri / MTA-KDD-19
View on GitHub
☆17Mar 16, 2021Updated 5 years ago