jihoontack / MACLinks
Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)
โ63Updated 10 months ago
Alternatives and similar repositories for MAC
Users that are interested in MAC are comparing it to the libraries listed below
Sorting:
- [๐๐๐๐๐ ๐ ๐ข๐ง๐๐ข๐ง๐ ๐ฌ ๐๐๐๐ & ๐๐๐ ๐๐๐๐ ๐๐๐๐๐ ๐๐ซ๐๐ฅ] ๐๐ฏ๐ฉ๐ข๐ฏ๐ค๐ช๐ฏ๐จ ๐๐ข๐ต๐ฉ๐ฆ๐ฎ๐ข๐ต๐ช๐ค๐ข๐ญ ๐๐ฆ๐ข๐ด๐ฐ๐ฏ๐ช๐ฏโฆโ51Updated last year
- Directional Preference Alignmentโ56Updated 8 months ago
- โ85Updated last year
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).โ16Updated 4 months ago
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Mergingโ106Updated last year
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewardsโ44Updated last month
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)โ53Updated 6 months ago
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"โ78Updated 2 weeks ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"โ78Updated last year
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)โ12Updated 2 months ago
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignmentโ55Updated 11 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Modelsโ55Updated 3 months ago
- โ94Updated last year
- Domain-specific preference (DSP) data and customized RM fine-tuning.โ25Updated last year
- Official implementation of Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs (ICLR 2024).โ40Updated 9 months ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"โ73Updated 2 weeks ago
- Lightweight Adapting for Black-Box Large Language Modelsโ22Updated last year
- This is the official repo for Towards Uncertainty-Aware Language Agent.โ25Updated 9 months ago
- Sotopia-ฯ: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)โ65Updated last year
- โ40Updated last year
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervisionโ120Updated 8 months ago
- โ97Updated 11 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messagesโ47Updated 6 months ago
- Self-Supervised Alignment with Mutual Informationโ18Updated last year
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignmentโ69Updated last year
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ouโฆโ29Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMsโ54Updated last year
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)โ100Updated 2 years ago
- This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or reโฆโ31Updated 8 months ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Promptingโ32Updated last year