PKU-Alignment / eval-anythingLinks
☆13Updated this week
Alternatives and similar repositories for eval-anything
Users that are interested in eval-anything are comparing it to the libraries listed below
Sorting:
- A comprehensive collection of process reward models.☆85Updated 2 weeks ago
- ☆100Updated last month
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆120Updated last week
- The official code repository for PRMBench.☆73Updated 3 months ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond☆228Updated this week
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆79Updated 9 months ago
- ☆56Updated 3 weeks ago
- A RLHF Infrastructure for Vision-Language Models☆176Updated 6 months ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆140Updated 3 months ago
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆92Updated last year
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆54Updated 6 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆212Updated this week
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆31Updated 9 months ago
- [ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆40Updated 2 weeks ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆96Updated last week
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆89Updated this week
- ☆151Updated this week
- repo for paper https://arxiv.org/abs/2504.13837☆144Updated last week
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆177Updated 4 months ago
- [ICML 2025] Official Implementation of GLIDER☆44Updated last week
- A Self-Training Framework for Vision-Language Reasoning☆80Updated 4 months ago
- An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation☆120Updated last year
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆250Updated last month
- The code repo of paper "X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Compromising Usa…☆26Updated 2 months ago
- ICLR 2025 Agent-Related Papers☆71Updated 6 months ago
- The reinforcement learning codes for dataset SPA-VL☆33Updated 11 months ago
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆54Updated 10 months ago
- ☆57Updated this week
- Paper collections of multi-modal LLM for Math/STEM/Code.☆96Updated last week
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆160Updated 3 months ago