wenzhe-li / Self-MoA
☆18Updated last month
Alternatives and similar repositories for Self-MoA:
Users that are interested in Self-MoA are comparing it to the libraries listed below
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆53Updated 9 months ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆12Updated 7 months ago
- ☆30Updated 2 months ago
- ☆13Updated last year
- Official implementation of paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https://arxiv.or…☆22Updated last month
- ☆14Updated 4 months ago
- ☆18Updated 8 months ago
- This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or re…☆26Updated 6 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆26Updated 11 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆51Updated this week
- Efficient Scaling laws and collaborative pretraining.☆15Updated 2 months ago
- Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"☆14Updated last month
- Self-Supervised Alignment with Mutual Information☆16Updated 10 months ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆16Updated 3 weeks ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆51Updated last month
- Long Context Extension and Generalization in LLMs☆50Updated 6 months ago
- ☆24Updated 7 months ago
- ☆15Updated 9 months ago
- ☆59Updated last week
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆44Updated this week
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆18Updated 8 months ago
- ☆16Updated last month
- ☆25Updated 7 months ago
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆22Updated 9 months ago
- ☆21Updated 5 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆52Updated last year
- ☆21Updated 8 months ago
- [ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)☆75Updated 5 months ago
- Official implementation of AAAI 2025 paper "Augmenting Math Word Problems via Iterative Question Composing"(https://arxiv.org/abs/2401.09…☆19Updated 3 months ago
- Directional Preference Alignment☆56Updated 6 months ago