Open-Social-World / autolibraLinks
AutoLibra: Metric Induction for Agents from Open-Ended Human Feedback
☆16Updated last month
Alternatives and similar repositories for autolibra
Users that are interested in autolibra are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆40Updated 2 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Updated 3 months ago
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆29Updated last year
- ☆13Updated 5 months ago
- ☆20Updated last month
- DataSciBench: An LLM Agent Benchmark for Data Science☆43Updated 3 months ago
- ☆25Updated 8 months ago
- ☆34Updated 7 months ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆49Updated last year
- ☆72Updated last month
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆45Updated 7 months ago
- ☆19Updated last year
- ☆17Updated 4 months ago
- Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'☆26Updated 6 months ago
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆31Updated 10 months ago
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆70Updated 8 months ago
- [ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"☆18Updated last year
- ☆51Updated 4 months ago
- ☆10Updated 2 years ago
- ☆30Updated last year
- ☆15Updated last year
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆13Updated last year
- ☆46Updated 2 months ago
- A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models☆27Updated last year
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆48Updated 6 months ago
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆30Updated last year
- Reinforced Multi-LLM Agents training☆60Updated 6 months ago
- ☆38Updated 3 months ago
- ☆30Updated last year
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆22Updated last year