Victorwz / LaViALinks
☆10Updated last year
Alternatives and similar repositories for LaViA
Users that are interested in LaViA are comparing it to the libraries listed below
Sorting:
- [ICML 2024] Self-Infilling Code Generation☆18Updated last year
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆123Updated 11 months ago
- Directional Preference Alignment☆59Updated 10 months ago
- ☆14Updated last month
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆25Updated last year
- ☆100Updated last year
- GenRM-CoT: Data release for verification rationales☆63Updated 9 months ago
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆29Updated 6 months ago
- ☆30Updated last year
- ☆27Updated 2 years ago
- Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"☆85Updated last month
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Le…☆75Updated last year
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Updated last year
- ☆45Updated 2 years ago
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆58Updated last year
- ☆48Updated 9 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆114Updated 4 months ago
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆27Updated 11 months ago
- Pitfalls of Rule- and Model-based Verifiers: A Case Study on Mathematical Reasoning.☆22Updated 2 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆44Updated 3 months ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆140Updated 10 months ago
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆58Updated 8 months ago
- Source code for the TMLR paper "Black-Box Prompt Learning for Pre-trained Language Models"☆56Updated last year
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆27Updated last year
- ☆34Updated last year
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆77Updated 2 years ago
- ☆33Updated 10 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆62Updated last year
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆36Updated 2 months ago
- ☆43Updated last year