Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"
☆18Oct 5, 2024Updated last year
Alternatives and similar repositories for Q-Adapter
Users that are interested in Q-Adapter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A python module designed for agile RL algorithm developing.☆26Jul 11, 2024Updated last year
- ☆14Mar 5, 2024Updated 2 years ago
- Minimal RLHF implementation built on top of minGPT.☆32Jul 4, 2024Updated last year
- ☆33Aug 30, 2024Updated last year
- (NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value☆35Mar 29, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Format your bibtex (.bib) file to help standardize citations for conference and journal submissions