wenzhe-li / Self-MoA
☆18Updated 2 months ago
Alternatives and similar repositories for Self-MoA:
Users that are interested in Self-MoA are comparing it to the libraries listed below
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆54Updated 10 months ago
- ☆31Updated 3 months ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆17Updated 2 weeks ago
- ☆18Updated 9 months ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆10Updated 3 weeks ago
- Official implementation of AAAI 2025 paper "Augmenting Math Word Problems via Iterative Question Composing"(https://arxiv.org/abs/2401.09…☆20Updated 4 months ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆29Updated 2 months ago
- Self-Supervised Alignment with Mutual Information☆16Updated 10 months ago
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆19Updated 9 months ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆14Updated 7 months ago
- ☆13Updated last year
- Efficient Scaling laws and collaborative pretraining.☆16Updated 2 months ago
- Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"☆39Updated 6 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆26Updated last year
- ☆15Updated last week
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Updated 8 months ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆35Updated 5 months ago
- Benchmarking Benchmark Leakage in Large Language Models☆51Updated 11 months ago
- Exploration of automated dataset selection approaches at large scales.☆38Updated last month
- Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"☆16Updated 2 months ago
- The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agen…☆26Updated last year
- Evaluate the Quality of Critique☆34Updated 10 months ago
- Lightweight Adapting for Black-Box Large Language Models☆22Updated last year
- ☆18Updated 3 weeks ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- ☆66Updated last month
- This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or re…☆29Updated 7 months ago
- ☆27Updated last year
- ☆35Updated last year
- ☆18Updated 11 months ago