Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
☆63Dec 5, 2024Updated last year
Alternatives and similar repositories for Verifier-Engineering
Users that are interested in Verifier-Engineering are comparing it to the libraries listed below
Sorting:
- Introduction about AWESOME_ENTROPY+LRM_PAPERS☆30Dec 16, 2025Updated 2 months ago
- Collection of papers for scalable automated alignment.☆93Oct 22, 2024Updated last year
- WisdoMentor - Series: A LLM for undergraduates | 博导智言(辅助大学生 学习)☆13May 9, 2024Updated last year
- Code for ACL 2022 long paper: Can Prompt Probe Pretrained Language Models? Understanding the Invisible Risks from a Causal View☆10May 17, 2022Updated 3 years ago
- ☆12Feb 11, 2026Updated 3 weeks ago
- ☆12Dec 25, 2023Updated 2 years ago
- Generative Visual Code Mobile World Model☆40Feb 4, 2026Updated 3 weeks ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆124Sep 9, 2024Updated last year
- Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).☆14Apr 4, 2025Updated 11 months ago
- ☆17Jan 9, 2025Updated last year
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆59Aug 24, 2023Updated 2 years ago
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆17Apr 24, 2024Updated last year
- Implementation of Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation. Paper: https://arxiv.org/abs/2404.06809☆22Oct 22, 2024Updated last year
- Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique☆18Aug 22, 2024Updated last year
- ☆43Sep 19, 2024Updated last year
- ☆47Mar 25, 2025Updated 11 months ago
- Implementation of Bitune: Bidirectional Instruction-Tuning☆24Jun 19, 2025Updated 8 months ago
- ☆28Oct 2, 2025Updated 5 months ago
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Nov 21, 2024Updated last year
- Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizX…☆88Mar 15, 2024Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆53Jun 6, 2025Updated 8 months ago
- A Self-Training Framework for Vision-Language Reasoning☆88Jan 23, 2025Updated last year
- ☆23Dec 8, 2022Updated 3 years ago
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆50Dec 15, 2023Updated 2 years ago
- Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State☆20Oct 24, 2025Updated 4 months ago
- ☆53Feb 11, 2025Updated last year
- ☆1,104Jan 10, 2026Updated last month
- [ACL 2024] Unveiling Linguistic Regions in Large Language Models☆33Jun 9, 2024Updated last year
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆125Mar 22, 2024Updated last year
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆64Nov 27, 2024Updated last year
- ☆30Sep 8, 2023Updated 2 years ago
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆26Aug 25, 2024Updated last year
- Code and data for "Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?" (ACL 2024)☆32Jul 3, 2024Updated last year
- ☆29Apr 30, 2024Updated last year
- [ACL'25 Findings] Official repo for "HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Task"☆37Apr 7, 2025Updated 10 months ago
- Plan✕ is a platform for creating and publishing digital planning services☆17Updated this week
- ☆215Feb 20, 2025Updated last year
- On Memorization of Large Language Models in Logical Reasoning☆74Mar 29, 2025Updated 11 months ago
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆77Oct 28, 2025Updated 4 months ago