holarissun / embedding-based-llm-alignmentLinks
Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs
☆16Updated last month
Alternatives and similar repositories for embedding-based-llm-alignment
Users that are interested in embedding-based-llm-alignment are comparing it to the libraries listed below
Sorting:
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆16Updated 4 months ago
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆26Updated last year
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆58Updated 2 months ago
- The code of paper "Toward Optimal LLM Alignments Using Two-Player Games".☆17Updated 11 months ago
- ☆30Updated last year
- ☆49Updated last year
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Updated last year
- Evaluate the Quality of Critique☆35Updated last year
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆25Updated last year
- ☆32Updated last year
- Teaching Models to Express Their Uncertainty in Words☆39Updated 3 years ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆12Updated 11 months ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆46Updated last year
- Explore what LLMs are really leanring over SFT☆28Updated last year
- ☆48Updated 3 weeks ago
- ☆12Updated 11 months ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆60Updated 2 years ago
- Official repository for ICLR 2024 Spotlight paper "Large Language Models Are Not Robust Multiple Choice Selectors"☆39Updated 2 weeks ago
- Learning adapter weights from task descriptions☆18Updated last year
- ☆40Updated last year
- In-context Example Selection with Influences☆15Updated 2 years ago
- Self-Supervised Alignment with Mutual Information☆19Updated last year
- Personality Alignment of Language Models☆37Updated 2 months ago
- ☆97Updated last year
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆13Updated 8 months ago
- ☆74Updated last year
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆25Updated 6 months ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆45Updated 7 months ago
- ☆50Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆76Updated last year