Victorwz / LaViA
β10Updated 4 months ago
Related projects β
Alternatives and complementary repositories for LaViA
- Methods and evaluation for aligning language models temporallyβ24Updated 8 months ago
- πΎ OAT: Online AlignmenT for LLMsβ27Updated this week
- β24Updated last year
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoningβ20Updated 8 months ago
- β41Updated last year
- Domain-specific preference (DSP) data and customized RM fine-tuning.β24Updated 8 months ago
- Directional Preference Alignmentβ49Updated last month
- β85Updated 11 months ago
- The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".β61Updated last year
- Source code for the TMLR paper "Black-Box Prompt Learning for Pre-trained Language Models"β55Updated last year
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinismβ25Updated 3 months ago
- β27Updated 8 months ago
- β46Updated 10 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMsβ63Updated last year
- Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimizationβ13Updated 3 weeks ago
- β20Updated 4 months ago
- β18Updated 2 months ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)β45Updated 7 months ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).β14Updated last year
- β35Updated 9 months ago
- [ICML 2024] Self-Infilling Code Generationβ18Updated 6 months ago
- β37Updated 7 months ago
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Leβ¦β68Updated 7 months ago
- β40Updated 11 months ago
- Analyzing LLM Alignment via Token distribution shiftβ13Updated 9 months ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$β29Updated 3 weeks ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervisionβ95Updated 2 months ago
- Restore safety in fine-tuned language models through task arithmeticβ26Updated 7 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.β46Updated 4 months ago