Hritikbansal / jpo
☆12Updated 4 months ago
Alternatives and similar repositories for jpo:
Users that are interested in jpo are comparing it to the libraries listed below
- ☆29Updated last year
- ☆25Updated 2 years ago
- ☆15Updated last year
- ☆15Updated 3 weeks ago
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆12Updated 2 years ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆44Updated 3 weeks ago
- ☆41Updated last year
- Mosaic IT: Enhancing Instruction Tuning with Data Mosaics☆18Updated 2 months ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated 11 months ago
- Methods and evaluation for aligning language models temporally☆29Updated last year
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆24Updated 11 months ago
- ☆16Updated 9 months ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆39Updated last year
- ☆16Updated 8 months ago
- Visual and Embodied Concepts evaluation benchmark☆21Updated last year
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆13Updated 10 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆42Updated 6 months ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆14Updated 8 months ago
- ☆40Updated last year
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆25Updated 3 months ago
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations☆17Updated 3 weeks ago
- Self-Supervised Alignment with Mutual Information☆18Updated 11 months ago
- ☆19Updated 9 months ago
- Extending context length of visual language models☆11Updated 4 months ago
- Directional Preference Alignment☆57Updated 7 months ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆25Updated 5 months ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆21Updated 5 months ago
- Restore safety in fine-tuned language models through task arithmetic☆28Updated last year
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Updated last year
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆26Updated 2 years ago