jacksonchen1998 / LLaMA-Paper-List
Collection of papers using LLaMA as backbone model
☆31Updated 4 months ago
Alternatives and similar repositories for LLaMA-Paper-List:
Users that are interested in LLaMA-Paper-List are comparing it to the libraries listed below
- ☆72Updated 3 months ago
- [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"☆53Updated 4 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆69Updated last year
- Direct Preference Optimization from scratch in PyTorch☆74Updated 11 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆64Updated 5 months ago
- ☆36Updated last year
- Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"☆47Updated last year
- Implementation of "ACL'24: When Do LLMs Need Retrieval Augmentation? Mitigating LLMs’ Overconfidence Helps Retrieval Augmentation"☆21Updated 6 months ago
- Official Code Repository for the paper "Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-intensive Tasks…☆37Updated 2 months ago
- Accepted LLM Papers in NeurIPS 2024☆33Updated 3 months ago
- TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models☆63Updated last year
- RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024☆65Updated 4 months ago
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆59Updated last year
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆65Updated last year
- Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models☆86Updated 5 months ago
- Continual Learning of Large Language Models: A Comprehensive Survey☆323Updated 3 weeks ago
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models☆79Updated 4 months ago
- DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems☆18Updated 4 months ago
- Notes and commented code for RLHF (PPO)☆56Updated 11 months ago
- ☆23Updated last month
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆126Updated 6 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆66Updated 2 weeks ago
- ☆28Updated 7 months ago
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆24Updated 10 months ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆39Updated 3 months ago
- Implementation of paper 'Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference' [NeurIPS'24…☆16Updated 7 months ago
- Using Explanations as a Tool for Advanced LLMs☆57Updated 4 months ago
- A Closer Look into Mixture-of-Experts in Large Language Models☆42Updated 5 months ago
- ☆15Updated 10 months ago
- A Survey on the Honesty of Large Language Models☆51Updated last month