janphilippfranken / sami
Self-Supervised Alignment with Mutual Information
☆16Updated 7 months ago
Alternatives and similar repositories for sami:
Users that are interested in sami are comparing it to the libraries listed below
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆41Updated 5 months ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs?☆26Updated 7 months ago
- Directional Preference Alignment☆54Updated 3 months ago
- ☆26Updated last year
- ☆27Updated 10 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆25Updated 9 months ago
- ☆20Updated 7 months ago
- Dateset Reset Policy Optimization☆28Updated 9 months ago
- ☆21Updated 4 months ago
- Lightweight Adapting for Black-Box Large Language Models☆19Updated 11 months ago
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆15Updated 6 months ago
- This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or re…☆22Updated 3 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated 10 months ago
- ☆15Updated 5 months ago
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆49Updated 7 months ago
- ☆15Updated 2 months ago
- ☆16Updated 6 months ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆42Updated last month
- ☆12Updated 3 weeks ago
- Long Context Extension and Generalization in LLMs☆40Updated 3 months ago
- Codebase for Instruction Following without Instruction Tuning☆33Updated 3 months ago
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆24Updated 10 months ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆11Updated 4 months ago
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆22Updated last month
- ☆44Updated last year
- [EMNLP Findings 2024 & ACL 2024 NLRSE Oral] Enhancing Mathematical Reasoning in Language Models with Fine-grained Rewards☆49Updated 8 months ago
- Evaluate the Quality of Critique☆35Updated 7 months ago
- ☆34Updated 11 months ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆24Updated last year
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 4 months ago