Victorwz / LaViA
☆10Updated 9 months ago
Alternatives and similar repositories for LaViA:
Users that are interested in LaViA are comparing it to the libraries listed below
- [ICML 2024] Self-Infilling Code Generation☆19Updated 11 months ago
- ☆12Updated 5 months ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆24Updated last year
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆25Updated last year
- ☆25Updated 2 years ago
- Extending context length of visual language models☆11Updated 4 months ago
- GenRM-CoT: Data release for verification rationales☆56Updated 6 months ago
- The code and data for the paper JiuZhang3.0☆43Updated 11 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆59Updated 9 months ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆28Updated 9 months ago
- Methods and evaluation for aligning language models temporally☆29Updated last year
- Directional Preference Alignment☆57Updated 7 months ago
- ☆44Updated 2 years ago
- ☆30Updated 7 months ago
- ☆33Updated last month
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆24Updated 8 months ago
- ☆15Updated last year
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆16Updated 3 months ago
- Self-Supervised Alignment with Mutual Information☆17Updated 11 months ago
- my commonly-used tools☆52Updated 3 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆43Updated last week
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆29Updated 5 months ago
- Visual and Embodied Concepts evaluation benchmark☆21Updated last year
- ☆40Updated last year
- ☆95Updated last year
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆51Updated 5 months ago
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆26Updated last year
- ☆13Updated 2 years ago
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆26Updated 2 years ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆38Updated last year