holarissun / embedding-based-llm-alignmentLinks
Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs
☆21Updated 8 months ago
Alternatives and similar repositories for embedding-based-llm-alignment
Users that are interested in embedding-based-llm-alignment are comparing it to the libraries listed below
Sorting:
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆70Updated 9 months ago
- ☆51Updated 2 years ago
- Links to publications that focus on the interpretation and analysis of in-context learning☆12Updated last year
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Updated last year
- The code of paper "Toward Optimal LLM Alignments Using Two-Player Games".☆17Updated last year
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆65Updated 2 years ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆124Updated last year
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆80Updated 2 years ago
- ☆52Updated 9 months ago
- Teaching Models to Express Their Uncertainty in Words☆39Updated 3 years ago
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Le…☆75Updated last year
- TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models☆83Updated last year
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆132Updated 2 years ago
- ☆47Updated 9 months ago
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆25Updated last year
- GenRM-CoT: Data release for verification rationales☆68Updated last year
- Code for ACL2024 paper - Adversarial Preference Optimization (APO).☆56Updated last year
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆99Updated 4 years ago
- Must-read Papers on Large Language Model (LLM) Continual Learning☆148Updated 2 years ago
- ☆61Updated 7 months ago
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆92Updated last year
- Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models☆117Updated 5 months ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆50Updated last year
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆57Updated last year
- Explore what LLMs are really leanring over SFT☆28Updated last year
- ☆57Updated 7 months ago
- ☆103Updated 2 years ago
- [ACL 2024 main] Aligning Large Language Models with Human Preferences through Representation Engineering (https://aclanthology.org/2024.…☆28Updated last year
- ☆46Updated last year
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆94Updated last year