jacksonchen1998 / LLaMA-Paper-ListLinks
Collection of papers using LLaMA as backbone model
☆40Updated 2 months ago
Alternatives and similar repositories for LLaMA-Paper-List
Users that are interested in LLaMA-Paper-List are comparing it to the libraries listed below
Sorting:
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆72Updated 2 years ago
- Official PyTorch implementation of DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs (ICML 2025 Oral)☆23Updated 2 weeks ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆52Updated 2 years ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆52Updated 4 months ago
- ☆30Updated 4 months ago
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning☆45Updated 11 months ago
- A Sober Look at Language Model Reasoning☆74Updated last week
- Official repository for ICLR 2024 Spotlight paper "Large Language Models Are Not Robust Multiple Choice Selectors"☆39Updated last month
- TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models☆75Updated last year
- [ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"☆47Updated last month
- ☆16Updated 3 months ago
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆19Updated 8 months ago
- Long Context Extension and Generalization in LLMs☆57Updated 9 months ago
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆62Updated 2 months ago
- ☆40Updated last year
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆81Updated last year
- ☆109Updated 3 months ago
- [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"☆59Updated 8 months ago
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆69Updated last year
- This package implements THOR: Transformer with Stochastic Experts.☆65Updated 3 years ago
- Code for the ACL-2022 paper "StableMoE: Stable Routing Strategy for Mixture of Experts"☆47Updated 2 years ago
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆61Updated last year
- Bayesian Low-Rank Adaptation for Large Language Models☆34Updated last year
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆69Updated last month
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Updated 2 months ago
- This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…☆85Updated 6 months ago
- Direct Preference Optimization from scratch in PyTorch☆98Updated 2 months ago
- ☆15Updated 8 months ago
- ☆155Updated 3 years ago
- Must-read Papers on Large Language Model (LLM) Continual Learning☆143Updated last year