icip-cas/Verifier-Engineering

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/icip-cas/Verifier-Engineering)

icip-cas / Verifier-Engineering

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

☆63

Alternatives and similar repositories for Verifier-Engineering

Users that are interested in Verifier-Engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

icip-cas / LiteCoder
View on GitHub
Advancing Small and Medium-sized Code Agents.
☆17May 29, 2026Updated last month
c-box / causalEval
View on GitHub
Code for ACL 2022 long paper: Can Prompt Probe Pretrained Language Models? Understanding the Invisible Risks from a Causal View
☆10May 17, 2022Updated 4 years ago
icip-cas / awesome-auto-alignment
View on GitHub
Collection of papers for scalable automated alignment.
☆92Oct 22, 2024Updated last year
panruotong / CAG
View on GitHub
Implementation of Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation. Paper: https://arxiv.org/abs/2404.06809
☆22Oct 22, 2024Updated last year
c-box / KnowledgeLifecycle
View on GitHub
Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"
☆58Aug 24, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Edward-Sun / easy-to-hard
View on GitHub
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
☆124Sep 9, 2024Updated last year
linjh1118 / survey_agent
View on GitHub
☆17Jan 14, 2026Updated 6 months ago
chenjiawei30 / ConsistentChat
View on GitHub
Code for "ConsistentChat: Building Skeleton-Guided Consistent Multi-Turn Dialogues for Large Language Models from Scratch", where dataset…
☆16Sep 8, 2025Updated 10 months ago
alexzhou907 / dialogue_evaluation
View on GitHub
☆22Dec 8, 2022Updated 3 years ago
zhaochen0110 / Cotempqa
View on GitHub
Code and data for "Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?" (ACL 2024)
☆31Jul 3, 2024Updated 2 years ago
sanmusunrise / AdaScaling
View on GitHub
Adaptive Scaling for Sparse Detection in Information Extraction
☆31Jun 12, 2018Updated 8 years ago
teorth / newton
View on GitHub
☆12Dec 25, 2023Updated 2 years ago
dkopi / Bitune
View on GitHub
Implementation of Bitune: Bidirectional Instruction-Tuning
☆27Jun 19, 2025Updated last year
CyberAgentAILab / regularized-bon
View on GitHub
Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).
☆14Apr 4, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ernie-research / Tool-Augmented-Reward-Model
View on GitHub
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
☆54Jun 6, 2025Updated last year
r-three / realistic_evaluation_of_model_merging_for_compositional_generalization
View on GitHub
☆13Feb 11, 2026Updated 5 months ago
euclid-multimodal / Euclid
View on GitHub
☆18Jan 9, 2025Updated last year
Zhou-Zoey / RMB-Reward-Model-Benchmark
View on GitHub
☆48Mar 25, 2025Updated last year
njucckevin / MM-Self-Improve
View on GitHub
A Self-Training Framework for Vision-Language Reasoning
☆90Jan 23, 2025Updated last year
zjunlp / KnowSelf
View on GitHub
[ACL 2025] Agentic Knowledgeable Self-awareness
☆93Jun 15, 2025Updated last year
zzhang0179 / Unveiling-Linguistic-Regions-in-LLMs
View on GitHub
[ACL 2024] Unveiling Linguistic Regions in Large Language Models
☆34Jun 9, 2024Updated 2 years ago
scaleapi / SWE-Interact
View on GitHub
New testbed of interactive SWE tasks for coding agents, set in a realistic multi-turn developer driven environment
☆24Jun 30, 2026Updated 3 weeks ago
Improbable-AI / curiosity_redteam
View on GitHub
Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizX…
☆90Mar 15, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
multimodal-art-projection / CodeCriticBench
View on GitHub
☆16Nov 1, 2025Updated 8 months ago
icip-cas / ChatAlpaca
View on GitHub
A Multi-Turn Dialogue Corpus based on Alpaca Instructions
☆176Jun 1, 2023Updated 3 years ago
Lagooon / LeanSTaR
View on GitHub
☆44Sep 19, 2024Updated last year
TianHongZXY / CoRe
View on GitHub
[ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)
☆51Dec 15, 2023Updated 2 years ago
lingo-mit / lm-truthfulness
View on GitHub
☆17Dec 21, 2023Updated 2 years ago
gl-ybnbxb / BoNBoN
View on GitHub
☆19Jun 3, 2024Updated 2 years ago
crushr / EANN_Implemetation
View on GitHub
EANN(Pytorch)
☆10Mar 12, 2022Updated 4 years ago
ltzheng / SimpleTIR
View on GitHub
[ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
☆401Mar 30, 2026Updated 3 months ago
weixuan-wang123 / ReMaKE
View on GitHub
☆14Sep 1, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
declare-lab / ferret
View on GitHub
Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique
☆19Aug 22, 2024Updated last year
dannyallover / overthinking_the_truth
View on GitHub
☆29Apr 30, 2024Updated 2 years ago
zhaochen0110 / conflictbank
View on GitHub
Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…
☆71May 16, 2025Updated last year
reds-lab / BEEAR
View on GitHub
This is the official Gtihub repo for our paper: "BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Lang…
☆23Jul 3, 2024Updated 2 years ago
icip-cas / AutoAlign
View on GitHub
A toolkit for automated alignment research.
☆15Jul 3, 2026Updated 3 weeks ago
jszheng21 / RACE
View on GitHub
RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.
☆14Oct 12, 2024Updated last year
GuanghaoYe / Emergence-of-Thinking
View on GitHub
☆55Feb 11, 2025Updated last year