THU-KEG/VerIF

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/THU-KEG/VerIF)

THU-KEG / VerIF

[EMNLP 2025] Verification Engineering for RL in Instruction Following

☆57

Alternatives and similar repositories for VerIF

Users that are interested in VerIF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

THU-KEG / Agentic-Reward-Modeling
View on GitHub
[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
☆134Jun 11, 2025Updated last year
yuleiqin / RAIF
View on GitHub
A Recipe for Building LLM Reasoners to Solve Complex Instructions
☆32Oct 9, 2025Updated 9 months ago
Rainier-rq / verl-if
View on GitHub
Official implementation of the paper "Instructions are all you need: Self-supervised Reinforcement Learning for Instruction Following"
☆40Jan 11, 2026Updated 6 months ago
kkk-an / UltraIF
View on GitHub
Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.
☆21Apr 3, 2025Updated last year
Junjie-Ye / MulDimIF
View on GitHub
[ACL 2026] A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models
☆23Jul 10, 2026Updated 2 weeks ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
allenai / IFBench
View on GitHub
☆160May 13, 2026Updated 2 months ago
Qihoo360 / Light-IF
View on GitHub
☆39Nov 20, 2025Updated 8 months ago
Tongyi-CCAI / Complex-IF
View on GitHub
☆34Jan 26, 2026Updated 5 months ago
THU-KEG / AtomR
View on GitHub
[KDD 2025] AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning
☆15May 27, 2025Updated last year
THU-KEG / WildReward
View on GitHub
Code for paper "WildReward: Learning Reward Models from In-the-Wild Human Interactions"
☆23Feb 26, 2026Updated 4 months ago
microsoft / MetaST
View on GitHub
☆26Jul 25, 2023Updated 2 years ago
THU-KEG / RM-Bench
View on GitHub
[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style
☆84Jul 18, 2025Updated last year
PKU-Baichuan-MLSystemLab / CFBench
View on GitHub
CFBench: A Comprehensive Constraints-Following Benchmark for LLMs
☆55Aug 26, 2024Updated last year
meowpass / FollowComplexInstruction
View on GitHub
Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…
☆55Jun 24, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
THU-KEG / AgentIF
View on GitHub
[NIPS 2025 DB Spotlight] AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios
☆39Dec 1, 2025Updated 7 months ago
meituan-longcat / Meeseeks
View on GitHub
A iterative feedback driven benchmark on LLM's instruction following ability
☆58May 25, 2026Updated last month
THU-KEG / LRM-FactEval
View on GitHub
☆17Jun 25, 2025Updated last year
MiniMax-AI / mini-vela
View on GitHub
☆37Apr 2, 2026Updated 3 months ago
MetaStone-AI / MetaStone-S1
View on GitHub
The open-source code of MetaStone-S1.
☆106Aug 1, 2025Updated 11 months ago
wzhouad / WPO
View on GitHub
Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"
☆41Sep 24, 2024Updated last year
BryceZhuo / HybridNorm
View on GitHub
The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
☆19Mar 7, 2025Updated last year
RUCAIBox / OlymMATH
View on GitHub
The OlymMATH dataset
☆24Jun 1, 2025Updated last year
open-compass / RePro
View on GitHub
[ICLR 2026] Rectifying LLM Thought From Lens of Optimization
☆15Dec 5, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MozerWang / DEMO
View on GitHub
[ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling
☆22Dec 16, 2024Updated last year
PRIME-RL / Entropy-Mechanism-of-RL
View on GitHub
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
☆444Jul 11, 2025Updated last year
OpenBMB / RLPR
View on GitHub
Extrapolating RLVR to General Domains without Verifiers
☆205Aug 12, 2025Updated 11 months ago
mandyyyyii / east
View on GitHub
☆19Aug 4, 2025Updated 11 months ago
MingLiiii / Gradient_Unified
View on GitHub
How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
☆20Jun 17, 2025Updated last year
LIFEBench / LIFEBench
View on GitHub
LIFEBENCH: Evaluating Length Instruction Following in Large Language Models
☆18Apr 23, 2026Updated 3 months ago
thu-coai / ComplexBench
View on GitHub
Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)
☆102Feb 20, 2025Updated last year
PKU-ML / LongPPL
View on GitHub
Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"
☆115Oct 11, 2025Updated 9 months ago
wumingqi / LLM-Math-Evaluation
View on GitHub
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination.
☆21Jul 18, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
THU-KEG / ADELIE
View on GitHub
[EMNLP2024] Aligning Large Language Models on Information Extraction
☆56Nov 4, 2024Updated last year
ZhangXJ199 / EDGE-GRPO
View on GitHub
Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity
☆22Aug 28, 2025Updated 10 months ago
TingchenFu / MathIF
View on GitHub
instruction-following benchmark for large reasoning models
☆49Apr 19, 2026Updated 3 months ago
PRIME-RL / RL-Compositionality
View on GitHub
FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones
☆68Jan 26, 2026Updated 5 months ago
zijian678 / TDD
View on GitHub
☆14Apr 22, 2024Updated 2 years ago
yichengchen24 / MIG
View on GitHub
[ACL2025 Findings] Official code for MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Spac…
☆28Aug 30, 2025Updated 10 months ago
MasterVito / SwS
View on GitHub
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆42Nov 11, 2025Updated 8 months ago